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(54) Error detection and error concealment for encoded speech data 



(57) In a low bitrate speech encoding system, 
encoded bits are strongly protected against errors pro- 
duced on a transmission path. A decoding side checks 
the transmission eirors using an error check code 
appended to a convolution decoded output and adjusts 
the decoding output depending on the results of check 
of transmission errors. At this time, it is necessary to 
maintain continuity of speech signals after speech 
decoding. To this end, a convolution decoder 1 6 convo- 
lution decodes the convolution coded output from the 



encoding device to provide a convolution decoded out- 
put of a crucial bit set with the appended error check 
code and a bit set excluding the crucial bit set. A CRC 
code comparator- frame masking unit 15 compares the 
CRC check code appended to the convolution decoded 
output from the convolution decoder 16 to the CRC 
check code computed from the bit group excluding the 
crucial bit set to adjust the convolution decoded output. 
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Description vr.v i^js:. in ; *-r'rc ^'^r: "'v^i^ : ' ■• r • ' n/'v;^-; : ***• 

[0(K)1] This invention relates to a>method and: apparatus fprrdecoding encoding paranneters selectively protected 
against errors produced in a transmission path, and a eornputer program for implementing, the method and apparatus: 

5 [0002] There are known a yariety of encoding .metbpds for .doing signal compression by exploiting statistic proper- 
ties in the time domain and, in the frequency domain of :audip signals, including the speech and acoustic signals; and 
psychoacoustic properties of the human being. As these encoding. methods, VSELP (Vector Sum Excited Linear Pre- 
diction) encoding system orthe PSI-CELP (Pitch Synchronous Innovation - CELP), classified underthe so-called CELP 
(Code Excited Linear Prediction) encoding system, is attracting attention as being a low bit rate speech encoding sys- 

10 tern. . :' * ■ .. •.. ■ . ^. ■ . 

[0003] In the waveform encoding system, such as this CELR encoding system, input speech signals are fbm^ed Into 
blocks or frames, with a pre-set number of samples of the input speech signals as an encoding unit, and a closed loop 
search of an optimum vector is performed using an analysis by. synthesis method on the b(ock-or frame-based time- 
domain speech waveform to execute vector quantization of the wavefomri to output the vector index. 

15 [0004] Meanwhile, the code bits, obtained with the speech encoding system with a low bitrate, such as 2 kbps or 4 
kbps, are widely used in communication, computer or.brpadcasting as general audio not constrained by specified audio 
information contents. Therefore, these code bits need to be protected strongly against errors^produced on the transmis- 
sion path. '';*,...... ... . 

[0005] If an error is produced in succession on the transrnission path,- sound dropout occurs in succession.for long 

20 time in speech decoding, thus lowering the speech qualrty,: ;: : ; ; . . . * 

[0006] The present inventors have clarified, in Japanese. Laying-Open Patent H-1.1-1221:20, an encoding niethod 
and apparatus and a decoding method and apparatus which is strong against errors occurring on the transmission path, 
to enable the sound quality to be improved appreciably. : . • v 

[0007] In this encoding method and apparatus, the input speech signal is divided on the time axis every pre-set 

25 encoding unit and encoded from one encoding unit to another to output plural sorts of the encoding speech parameters. 
Of these plural sorts of the encoding speech parameters; a psych oacoustically crucial bit group is selected and, from 
this crucial bit group, an error check code is generated. This en'or check code and the crucial bit group are subjected to 
convolution coding to enable protection of .the psychoacoustically crucial bit groups from errors occurring on the trans- ■ 
mission path. 

30 [0008] For decoding the transmitted encoded data, produced on convolution encoding of the error check code and 
the crucial bit group and on merging to the bit group excluding the crucial bit group, the decoding method and apparatus 
process the convolution coded output with coevolution decoding, convolution , decode the crucial bit group, added to with 
the error check code, and the bit group excluding the crucial bit group, output the convolution decoded output, check 
the transmission error using the error check code appended to the convolution decoded output,^ adjust the convolution 

35 decoded output depending on the result of the error chieck and process the adjusted convolution decoded output with 
speech decoding, thus enabling decoding of the speech such as to suppress the lowering of the sound quality due to 
errors occun-ing on transmission paths. . * . i ; • : . : . . . 

[0009] Meanwhile, in the above-described decoding method and apparatus, It is necessary to maintain signal con- 
tinuity as speech signals following the speech. decoding when adjusting the decoded output responsive to the results of 

40 error detection as to transmission errors using the error check code appended to the convolution decoded output. - 
[0010] It would be desirable to provide a decoding method and apparatus and a program furnishing medium 
whereby signal continuity as speech signals signal may be maintained, so far as is possible, to enable decoding of the 
high quality speech. ■ . , ■ . . ; : : , ^ , . ' . , ^; 

[0011] In one aspect, the present invention provides a decoding apparatus, for decoding transmitted; encoded 

45 parameters corresponding to input speech signals which have been encoded by an encoding devbe-from one pre-set 
encoding unit on the time axis to another and which have been classed depending on the psychoacoustic sensitivity of 
bits to transmission path errors, wherein the decoding apparatus .Includes transmission path decoding means for 
detecting an en-or using an en-or check code appended to the:encoding parameters of a pre-set class and providing for 
different frame masking processing depending on errors. in-the encoding parameters produced in the encoding unit' 

50 [0012] Preferably, the encoding processing in the encoding device is performed using a plurality of vector quantiz- 
ers, an index representing each code vector is classified.according to psychoacoustic sensitivity of bits against trans- 
mission path errors and is transmitted in this classified state. The transmission path decoding means detects the en-or 
check codes appended to the index of a pre-set class.to provide for different frame masking processing depending on 
an index error produced in the encoding unit, 

55 [0013] Preferably, the encoding processing in the encoding device is performed using a multi-stage vector quantizer 
prepared by alternate learning, and an index representing each code vector is classified according to psychoacoustic 
sensitivity of bits against transmission path errors and ;is transmitted In this classified state. The transmission path 
decoding means detects the error check codes appended to the index of a pre-set class to provide for different frame 
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masking processing depending on whether or not an error has been detected in plural indexes. 
[0014] In another aspect, the present invention provides a decoding method for decoding transmitted encoded 
parameters corresponding to. input speech signals which have been encoded-by ah^'ericb'ding' device from one pre-set 
encoding unit on the time-axis to another and whichhave been classed depending on the psycHdacoustic sensitivity of 

5 bits to transmission. path:errors, wherein the decoding method includes it tl^nshniissibh path decoding step' of detecting* 
an enx)r using an error check code appended to the encoding parartietefs df a pre-set class 'and f)r6viding for different 
frame masking processing depending on errors in the encoding parameters produced in the'encoding unit. 
[0015] Preferably, the encoding processing'^in the encoding device is'perfbrmed using a plurality of vector quantiz- 
ers, an index representing each code vector is classified according- to psychoacoustic sensitivity of bits against trans- 

10 mission path errors and Is transmitted In this classified state. The transmission path decoding step detects the error 
check codes appended to the Indexfof a pre-set class to provide for different frame masking processing depending bri 
an index error produced in.the encoding unit, 

[0016] Preferably, the encoding processing in the encoding device is performed using a multi-stage vector quantizer 
prepared by alternate learning, and an index representing each code vector is classified according to psych oacoustic 

15 sensitivity of bits against transmission path errors and is transmitted in this classified state. The transmission path 
decoding means detects:the error check codes appended to the index ota pre-set class to provide for different frame 
masking processing depending on whether or not an error has been detected in plural indexes. = ' 
[001 7] In still another aspect, the present invention provides a program furnishing medium for furnishing a program 
for decoding transmitted encoded parameters corresponding to input speech signals which have been encoded by an 

20 encoding device from one pre-set encoding unit on the time axis to another and which have been classed depending 
on the psychoacoustic sensitivity: of bits to transmission path errors, wherein the program includes a transmission path 
decoding step of detecting an error using an error check code appended to the encoding parameters of a pre-set class 
and providing for different frame masking processing depending on errors in the* encoding parameters produced in the 
encoding unit;' ; C: ' . ^. .\ , / ' . i ' - . a ■ . < u 

25 [0018] Preferably,, the encoding processing in the encoding device is* pierformed using a plurality of vector quantiz- 
ers, an index representing each code- vector is classified according to psychoacoustic sensitivity of bits against trans- 
mission path errors and is transmitted in this classified state, wherein the transmission fDath decoding step detects the 
error check codes appended to theiindex of a pre-set class to provideifor different frame masking processing depending 
on an index error produced in the encoding unit 

3o [0019] Preferably, the encoding processing in the encoding device is performed using a multi-stage vector quantizer 
prepared by alternate learning; and wherein an index= representing each code vector is classified according to psycho- 
acoustic sensitivity of bits against transmissioh.path errors and is transmitted in this classified state: The transmission 
path decoding means detects the error check codes appended to the index of a pre-set class to provide for different 
frame masking processing depending on whether or.not an error has been-detected in plural indexes; ' 

35 [0020] In the decoding method and apparatus of the present invention,^ It is possible to keep continuity as speech 
signals to decode the speech of. high sound quality. ■ ; > }. * ^ . i • 

[0021] With the use of the program furnishing medium of the present invention, it is possible to maintain continuity 
of speech signals in a computer system to decode the speech. of high sound -quality. ■ 

[0022] Referringlo the drawings, preferred embodiments of the present invention will be explained in detailj by way 
40 of non-limitative example with reference to the acconipanying drawings, in which: - 

Fig. 1 is a block diagram showing the structure of a portable telephone device embodying the present invention.- 
Fig. 2 is a block diagram showing a basic structure of s speech encoder constituting the portable telephone device.' 
Fig.3 is a block; diagram showing a detailed structure of the speechi-encoder. • 

45 Fig.4 shows parameters'comhion to bitrates of 2 and.4 kbps. ' < • ' r r . = ' \ • . 

Fig.5 shows parameters properonly to 4 kbps. . .-j.,. ,.r / r ^' 

Fig. 6 shows class. division of voiced speech parameters: with the bitrate. of 2 kbps. ^ • • 

FigT shows class division of. unvoiced sound parameters with the? bitrate of 2 kbps. ' ' • ' 

Fig. 8 shows class division of voiced speech parameters with the bitrate of 4 kbps:. " ^ ; » 

50 Fig.9shows classdivision.of unolced sound parameters with-the bitrate of 4 kbps. 

Fig. 1 0 shows an inputting sequence of voiced.speech parameters to a channel. coder with the bitrate of 2kbps. ' 
Fig. 1 1 shows an inputting sequence of unoiced sound parameters to.a channel coder with the bitrate of 2kbpsV * 
Fig. 1 2 shows an inputting sequence of voiced speech parameters^to a class 1 bit channel coder with the bitrate of 
4kpps. • ' 

55 Fig. 1 3 shows an inputting sequence of voiced speech parameters to.class II to V bit channel coder with the bitrate 

of 4kbps. , . .■ ' ! . ..<•.;, . ; . , • . ' 

Fig. 14 shows an inputting sequence of voiced speech parameters to class'VI and class VII bit channel coder with 
the bitrate of 4kbps. ... ^ ' , , - = • • , • ? 
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Rg. 15 shows an inputting sequence of unvoiced sound parameters to a class I bit channel coder with the bitrate 

of 4kbps. . ' ^ i • -rr. ; n-i:?if.:''-? ^ .^' -y.- . 'l y-^ -y - : • ' ■ r ■ ' ■ ■■ ■ 

Rg. 16 shows an inputting sequence of unvoiced sound parameters to a class 11 to class V bit channel coder with 

the bitrate of 4kbps. /• ^ • ' ! j j • . ■ . - : • ./ , • • v : 

5 Rg. 17 shows an inputting sequence of.ijnvbiced'sound paiBmeters to dass VI and class. VII channel coder with 

the bitrate of • 4kbps. . *• • ' : • . . • - . . . v/^-^ • . . . • 

Rg.i8 shows bit assignment of the class I to VI of the voiced speech of 2 kbps and the class II to VII of the voiced > 

sound of 4kbps. m / • o : . . . . 

Rg. 1 9 shows state transition for illustrating the operation of a transmission path decoder constituting the portable 
10 telephone devrce. m «. , * . ■ . . > 

Fig.20 shows the values of Interpolation coefficients P used In the equation (7).. . ./ . ; 

Fig. 21 illustrates the setting of mute variable mute used for controlling the sound -volume of an output speech 

responsive to the value of the state variable state. 

Fig. 22 shows values of s[i] used In the equation (10). ; ' ? . 
15 r ig.23 is a block diagram showing a basic structure of the speech decoder. ' ^ 

Rg.24 is a block diagram showing a detailed structure of the speech decoder. 

Rg.25 is a flowchart for illustrating the encoding method executed by the portable telephone device. 

Rg.26 is a flowchart for illustrating the decoding method executed by the portable telephone device. . 

Rg.27 is a block diagram of a transmission system to which the present invention can be applied. - v - 
20 Rg.28 is a block diagram of a server constituting the- above transmission system. 

Rg.29 is a block diagram showing a client tenminal constituting the transmission system.* 

[0023] The illustrated embodiment is directed to a portable telephone device having a decoding device as specified 
example of the decoding method and apparatus of the present invention, as shown in Fig. 1 . This portable telephone* 

25 device also includes an encoder device for encoding the speech with a low bitrate of, for example. 2 or 4 kbps: ' • 
[0024] The encoding device splits an input speech -signal every encoding unit on the time axis and encodes the 
speech signal from one encoding unit to another to generate plural sorts of the speech encoding parameters: The bits 
are allotted to plural protective classes, depending on the psychoacoustic sensitivity of bits to transmission path en'ors' 
of the speech encoding parameters. For example, if the bitrate is 2 kbps or 4 kbps, the bits are allocated to 6 and 7 

30 classes, respectively. The class allocation and speech encoding parameters will be explained in detail subsequently. 
[0025] Rrst, the encoding device of the present portable telephone device includes a speech encoder 3, a class 
division and inputting sequence decision: unit 23 and a transmission path encoder 4. The speech encoder 3 splits the 
input speech signal on the time axis in: terms of a pre-set encoding unit and encodes the input ^signal every encoding 
unit to output plural sorts of the speech- encoding parameters. The class division and inputting sequence decision unit 

35 23 classifies the plural sorts of the speech encoding parameters from the speech encoder 3 into 6 and 7 classes, if the 
bitrate is 2 kbps or 4 kbps, respectively, depending on the psychoacoustic sensitivity of the bits to transmission path 
errors, while detemnining the bit inputting sequence to the transmission path encoder 4 of the next stage.The transmis- 
sion path encoder 4 generates CRC (cyclic redundancy check) codes, depending on the classes obtained by the dass 
division and inputting sequence decision unit 23, and on the bit inputtirig sequence, and applies the CRC codes to the 

40 bits. The transmission path encoder 4 occasionally applies convolution coding to the bits in addition to the CRC Codes 
by way of protection. The transmission path encoder 4 occasionally performs no protection. 

[0026] The class division* and inputting sequence decision unit 23 includes a bit controller 23a and a RAM 23b. The 
bit controller 23a classifies the plural sorts of the speech encoding parameters from the speech encoder 3' depending 
on the psychoacoustic sensitivity of the bits to transimission path errors; using the RAM 23b as a work area, while deter- 

45 mining the bit inputting sequence. * ' i-' r ! . . ■ -. - . , , 

[0027] The transmission path encoder 4 includes a CRC code calculating unit 5 and a convolution encoder 6. The 
CRC code calculating unit 5 generates CRC (cyclic redundancy. check) code. The convolution encoder 6 occasionally 
applies the convolution coding to the bit group, to which the CRC code has been appended in the CRC code calculating 
unit 5, The transmission path encoder 4 occasionally sinnply outputs the bit group, to which has been appended the 

50 CRC code from the CRC code calculating unit 5. The transmission path encoder 4 may also transrhits the bit group with- 
out doing any processing thereon, depending on the class, i 

[0028] In the portable telephone device, the decoding device, employing the decoding method and apparatus 
according to the present invention, includes a convolution decoder 16, adapted for convolution decoding the crucial bit 
group, having the error correction code simply appended thereto, and the bit group, excluding the crucial bit group, and 
55 for outputting the result of the convolution decoding, and a CRC code comparator-frame masking unit 15 for comparing 
the CRC code appended to the convolution decoded output of the convolution decoder 1 6 to the CRC code calculated 
from the bit group excluding the crucial bit group to adjust the convolution decoded output. The decoding device also 
includes a speech decoder 1 7 for speech decoding this convolution decoded outputof the CRC code comparator- frame 
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masking unit 15.:r.' - •:;.>; . v:.. . .ly • • j t. r.; - jf-v-j,.;,-; •.rir.s fcr^/cw.it ■■ r ' . ; - : • v 

[0029] In the present portable telephone device, the speech signals, inputted in transnnission at a microphone 1 , is 
converted by an A/D,converter 2 into digital signals/ iWhich then areiencoded by the speech encoder, 3 into. encoded 
parameters of a low bitrate of 2 kbps/4 kbps. These encoded parameters are processed by the class division and input* 

5 ting sequence decision- unit j23 for classification and inputting sequence decisibn:^>rhe parameters from the class divi-^ 
sion and inputting sequence decision unit 23 are routed to the transmission path encoder 4 where the parameters are 
encoded such that the quality/of the-transmission path is less liable to affect the speech quality. The encoded parame- 
ters are then modulated by a modulator 7 and thereby processed into output bits which are transmitted through an 
antenna co-user 9 from, an antenna 10., < t .. . v • . ••. 

10 [0030] During reception, the electrical waves, captured by the antenna 10, are received through the antenna co- 
user 9 by a receiver 1 1 and demodulated by a demodulator .13 so as to be then corrected for transmission errors by the 
speech encoder 17 and .converted by a D/A converter, 18 'back':intO' analog speech signals : which are outputted at a 
speaker 19. 

[0031] A controller 20 controls the above-mentioned various portions, whilst a synthesizer 12 imparts the transmis- 
15 sion/reception frequency to a transmitter 8 and the receiver^l 1 /A key-pad 21 and an LCD indicator 22 are utilized as a 
man-machine interface. - : : t . % . 

[0032] In the above-described portable telephone device, the CRC;c6de calculating unit 5 constituting the transmis- 
sion path encoder 4, selects part orall of line spectrum pair (LSP) parameters, generally defining the shape of theire-. 
quency spectrum of the speech signals, classified and having its inputting sequence determined in the class division . 

20 and inputting sequence decision unit 23, all of voiced (V)/unvoiced (UV) decision. parameters indicating whether the 
speech signals are voiced (V) or unvoiced (UV), part or all of pitch, parameters when the speech signals are voiced 
sound, part or all of the spectrum codebook index and gain index indicating the spectrum envelope of residual signals 
of linear prediction coding (LPC) when the speech signals are voiced sound; arid part or alhof the noise codebook index 
and the gain codebook.index of residual signals of linear prediction coding (LPC) when the speech signals are unvoiced 

25 sound, to generate CRC; codes therefrom: - \> ; j. > . :r . . ;r - - 

[0033] These speech encoding parameters are acquired in the speech- encoder 3. The speech encoding method, 
carried out by. the. speech encoder 3, 'includes a.short prediction residue, calculating step of finding short prediction 
residuals of the input speech. signal, a sine, wave /analysis, encoding step of sine wave analyzing and encoding the cal- 
culated short term prediction errors, and a waveform encoding step of encoding the input speech signal by vyaveform 

30 encoding. This speech encoders is now explained with reference :to Rgs.2 and 3. - " 

[0034] The basic concept of the speech encoder 3 of Fig.2 resides in employing a first encoding unit 1.1 0 for finding 
short temn prediction errors, such as LPQ (linear prediction coding) residuals, of input speech, signals, to execute sinu- 
soidal analysis encoding, such as hamnonic coding, and a secpnd^encoding unit 120, for encoding.the input speech sig- 
nal by waveform coding exhibiting phase reproducibility, with respect- to the input speech signal, and in employing the 

35 first encoding unit ;1 1 0 and the second encoding unit 1 20 fonencoding the voiced (V) portion and the unvoiced (UV) por- 
tion of the input speech signal, respectively. ^ . : .. / ■ , 
[0035] The first encoding unit 1 TO employs. a configuration of performing sinusoidal analysis encoding, such as, har- 
monic encoding or multi-band excitation (M BE) encoding, on the LPC residuals. The second encoding unit 120 uses 
the configuration of the code excited linear-prediction (QELP) coding employing the.vector quantization by a closed loop 

40 search of an optimum vector by e.g,, ari analysis by-synthesis method.: - • . • * .» - • . - r . - . 

[0036] In an example of Rg.2, the speech signal sent to an input terminal 101 is routed to a LPG inverse filter 1 11 
and an LPC analysis:quantization,unit. tT3 of the first.encoding unit rlJO. The LPC coefficients, or so-called a-parame- 
ter, acquired from the LPC analysis quantization unit 113, is sent to,the.LPC inverse filter 111 from which linear predicr. 
tion residuals (LPC Tesi duals), 9f,the ir)put speech.signal are recovered. ;From the LPC analysis quantization unit 113, a 

45 quantized output of the LSP (line spectrum pair) is taken out and sent to an output terminal 1 02. The LPC residuals from 
the LPC inverse filter 1A1 are sent to a-sinusoidal.analysis encoding^unit 1 14, adapted for executing pitch detection, cal- 
culations of the, spectral envelope amplitudes and V/UV decision by. a V/UV (voiced/unvoiced) decision unit 115. The 
spectral envelope amplitude data from the sinusoidal analysis encoding "unit 1 14. are sent to a vector quantization unit* 
116. The codebook index from the vector quantization unit 1 16 as a vector, quantization output of the spectral envelope 

so is sent via a switch. '117 to the output temninal 1 03. An output of, .the sinusoidal analysis encoding unit 1 1 4. is sent via 
switch 1 1 8 to the output terminal 103. A VAJV decision output from the V/UV.decision unit 1 1 5, is sent to the.output ter^ 
minal 105. while also being sent to. switches . 11 7. 118 as control, signals therefor, so that, if the input speech signal is. 
voiced (V), the index and the pitch are seiected and taken out at the output terminals 103, 104, respectively. ... 
[0037] The second encoding unit 120 of Fig.2 has a code excited linear prediction (CELP) coding structure and exe-; 

55 cutes vector quantization of the time domain waveform by a.closed loop. search method consisting in synthesizing an' 
output of a noise codebook 121 by a weighted synthesis tilter 122, routing the resulting weighted speech to a.subtractor 
1 23 to take out an error thereof from the speech supplied to the input .terminal 1 01? and passed through a psychoacous- 
tic weighting tilter 125, and in sending the error.to a distance calculating circuit 124 to.calculate the.distance andsearch- 
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ing a vector which will minimize the error. The iGELP encoding is used for' encoding 'the unvoiced spee'ch portion the 
codebook index, as the UV data from the CELP enc6ding;-is taken out at an dutput'terminal 107 through a Wi^ 127 
which is turned on when the result of V/UV decision frbhri-tiie V/UV decision unit 1 15 indicates unvoiced (UV) sound. 
[0038] Rg.3 shows a more detailed structure of the speecli encoder s. In Fig.3. parts oV components corresponding 
5 to those of Fig.2 are indicated by the sanie reference humeralsr^^^ - 

[0039] In the speech encoders, shown in Fig.3, the speech signal supplied to the input temnlnall Oris filtiered by 
a high-pass filter (HPF) 109 to remove signals of an Unneeded frequency range; The filtered output is sent to an LPC 
analysis circuit 132 of an LPC (linear prediction coding) analysis quantization unit 113 and to an LPC back-filtering cir- 
cuit 111. : , • •• . ... •. • 

10 [0040] The LPC analysis circuit 132 of the LPC analysis quantization tinit 1 13 applies the Hanfiming window, with a 
length of the input signal waveform on the order of 256 samples as a block, to find linear prediction coeifflcients by an 
autocorrelation method, that is a so-called a-parameters. The framing interval as a data outputting unit is on the order 
of 1 60 samples. With the sampling frequency fs of, for exampl^, 8 kHz, the frame interval is 1 60 samples or 20 msec. 
38. The a-parameterfrom the LPC analysis circuit 132 is sent to an a- LSP' conversion circuit 133 for conversion to a 

15 line spectrum pair (LSP) parameter. This converts a-parameters,* found as straight type filter coefficients, into e:g., ten, 
that is five pairs, of LSP parameters. The conversion is by e.g;, the Newton-Rhapson method. The reason for doing cbn- 
version into LSP parameters is that the LSP parameters are superior in interpolation characteristics to a-parameters. • 
[0041] The LSP parameters from the a-LSP converson circuit 1 33 are matrix- or vector-quantized by the LSP quan- ' 
tizer 134. In this case, the interframe difference is taken prior to vector quantization, or plural frameis are taken together 

20 and subjected to matrix quantization. Here, two frames of LSP parameters, calculated every 20 msec, with a frame 
being 20 msec, are taken together and subjected to matrix quantization and to vector quantization. 
[0042] A quantized output of an LSP quantizer 1 34. that is the index of LSP quantization, is feke'n out at a terminal 
1 02, while the quantized LSP vector is sent to an LSP interpolation circuit 1 36. ' j = . * : ^ 

[0043] The LSP interpolation circuit 136 interpolates the LSP vector.' quantized every 20 msec'or every '40 nnsec. 

25 to raise the sampling rate by a factor of eight, so that the LSP vector will be updated every 2.5 misec. The reason i^ that, 
if the residual waveform is analysis-synthesized by the harmonic encoding/decoding methb'd, the envelope of the syn- 
thesized waveform is extremely smooth, such that, if the LPC coefficients are changed extrerriely rapidly, extraneous 
sounds tend to be produced. That is, if the LPC cdefffcients arie changed only gradually every 2.5 ririsec, such extrane- 
ous sound can be prevented from being produced. 

30 [0044] For executing the back-filtering if the input speech using the interpolated 2.5 msec based LSP vector, the 
LSP parameter is converted by an LSP-^a conversion circuit 1 37 into a-parameters which are coefficients of a straight 
type filter with an order of the onder often. An output of the LSP^ a conversion circuit 137 is sent to the LPC back-fil- 
tering circuit 1 1 1 where back-filtering is carried out with the d-parameter'updated every 2.5 hnsec to realize a smooth 
output. An output of the LPC back-filtering circuit 1 1 1 is sent to an orthogonal conversion cirbuit 1 45, such as a discrete ■ 

35 Fourier transform circuit, of the sinusoidal analysis encoding unit 114, specifically, a harmonic encoding circuit: 

■ [0045] The a-parameter from the LPC analysis circuit 1 32 of the LPC'analysis quantization unit 1 1 3 is sent to a psy- 
choacoustic weighting filter calculatinjg circuit 139 where data for psych oacoustic sensitivity is found. This weighted 
data is sent to the psychoacoustlcally weighted vector quantization Unit 1 16,'psychbacoustic weighting filter 125 of the' 
second encoding unit 120 and to the psychoacousticsilly weighted synthesis filter 122. ' 

40 [0046] In the sinusoidal analysis encoding unit 114, sucK^s the hanmonic encoding circuit, an output of the LPC 
back-filtering circuit 111 is analyzed by a harmonic encoding method. That is, the sinusoidal analysis encoding unit 114 
detects the pitch, calculates the amplitude Am of each harmonics arid performs V/UV discrinii nation. The sinusoidal 
analysis encoding unit 1 14 also dimensionally converts the number of the amplitudes Anni or the envelope of hannonics,' 
which is changed with the pitch, into a constant nurriber. • ~ ^ « . • ^ . . 

45 [0047] In a specified example of the sinusoidal analysis encoding unit 1 14, shown in Fig.3^ routing hamribnlc encod- 
ing is presupposed. In particular, in multi-band excitation (MBE) encoding, modelling is performed on the assurription 
that a voiced portion and an unvoiced portion are present in each frequency range or band at a coricurrent time, that is 
in the same block or frame. In other forms of harmonic coding, ah alternative decision is made as to whether the speech 
in a block or frame is voiced or unvoiced. In the following explanation, V/UV on the frame basis means the V/UV of a 

so given frame when the entire band is UV in case the MBE coding ik applied.' Meanwhile, the Japanese Laying-Open Pat- ' 
ent H-5-265487, proposed by the present Assignee, discloses a sjsecified exannlple for the synthesis by analysis method 
of MBE. • . • ^ ' . 

[0048] An open-loop pitch search unit 141 of the sinusoidal'analysis encoding unit 1 14 of Fig,3 iis fed with an input " 
speech signal from the input terminal 1 01 , while a zero-crossing counter 142 is fed with a signal from a high-pass filter 

55 (HPF) 1 09. The orthogonal conversion circuit 1 45 of the Sinusoidal analysis encoding unit 1 1 4 is fed with LPC residuals 
or linear prediction residuals from the LPC back-filtering circuit 111. The open-loop pitch search unit 1 41 takes the LPC 
residuals of the input signer to perforin relatively rough pitch search by taking LPC residuals of the input signal. The 
extracted rough pitch data is sent to a high-precision pitch search unit 146 where high-precision pitch search by the 
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closed loop as later explained (fine piteh se£yTch).is perfprmed.; From th^.^ppenrJoop pitch search unit^l 41 , the maximum 
normalized autocorrelation, yiaiue,r(pj, obtained on,norTnalizing the maximur?i lyalue;.;^^^^ of the LPC 

residuals, are taken put along with the rough , pitch date and $ent tp.;the,V/uyrdecis 115. . : ? s - - 

[0049] The orthogonal conversion circuit 145 performs orthogonal, transfonriprpcessingi. such as discrete ^cosine 

5 transform (DFT). to transform LPC residuals on the time axis into spertraiamjaW^ output pf the orthogonal 

conversion circuit 1 46 is sent to the high-precision piteh search .unit ^1 46,.and:to^ a spectrum evaluation unit 1 48. 
[0050] The high-p^recjsipn, (fine), pitch search unit ,146 is f?d;Vyith;a,Tpugh pitch date of a relatively rough pitch 
extracted by the open-loop. pitch search unit 141 and with date on the frequency domain extracted by the open-loop , 
pitch search unit 141. In this high-precision pitch search unit 146, pitch date are swung by tseveral samples, with the 

10 rough pitch data value as center, to approach to values of fine pitch d^te having an optimunri decimal point (floating). As 
the fine search technique, the sp;called analysl&by synthe^is^method is used and the pitch is selected so that the syn- 
thesized power spectrum will be closest to the power spectrum of the .original speech. The pitch jdata from the high-pre- 
cision pitch search unit 1 46 by the closed loop Is sent, through switch 11 8 to the output terminal 1 04. . 
[0051 ] In the spectrum evaluation unit 1 48, the magnitude of each harmonics and. a spectral envelope as its set are 

75 evaluated, based on the pitch and the spectral amplitude as an orthogonal transform output of the LPC. residuals. The- 
result of the evaluation is. sent to the high -precision piteh search .unit. 146, 'V/UV decision unit 115 and to the psychoa- . 
cousticalty weighted vector quantization uriit 1 16. - - 

[0052] In the V/Uy decision unit 116,, V/UV decision of a frame, in question is given based on an pu^ut of the 
orthogonal conversion circuit 1 45, ^n optimum pitch from the high-precision pitch search unit 146, amplitude data from 

20 the spectrum evaluation uriit 1,48, -.a maximum, nomialized autocorrelation value r(p) from the open-loop pitch search 
unit 141 and the value of zero crossings from the zero-crossing counter ,142, The boundary position of the result of the 
band-based V/uy decision in case of MBE,coding may also bejUsed.as.a-conditipn pf the V/UV decision of the frame 
in question. A decision output of the VAJ V decision unit 1 15 is taken out via output terminal .1 05. : 
[0053] An output.of ^the.spectrum. eyaiuation unit 1 48 or an input of the.vector quantization unit 1 1 6 is provided with , 

25 a number of data conversion unit 1 1 9, which, isf a sort.of a sampling rate conversion unit. This number of data conversion 
unit operates for setting the amplitude d^taJAml of the. envelope to a constant nurriber in consideration ^that the number 
of bands split on the frequency domain Js varied with the pitch and hence the number of data is varied. That is, if the 
effective band is up to 3400 kHz, this effective banc! is split into 8: to 63. bands, depending on the pitch, such that the 
number mMx+ 1 t^^e amplitude data I An,l obtained from band to band also is varied in a range from 8 to 63, So, the 

30 number of data conversion unit 1 1 9 converts this variable number m^x+l amplitude, data into a constantnumber M, for 

example, 44. . . . ^ ; . ■ :■• ..; . 

[0054] The above-mentioned consterit number M, such as 44, arnpiltude data or eriyelope data from the number of, 
data conversion uri'rt provided at an output of the spectrum evaluation unit 148 or at an input of the vector quantization 
unit 116 are taken together in terms of a pre-set number of date, such as 44 data,, as vectors, which, are subjected to. 

35 weighted vector quantization. This weighting is imparted by an ou^ut of the psychoacpustic weighting filter calculating 
circuit 1 39. An index.idS of the above-mentioned envelope from the vector quantization unit 1 1 6 is outputted at the out- 
put terminal 103 through switch 117. Meanwhile, an inter-frame difference employing an appropriate leakage coefficient 
may be taken for a vector made up of a pre-set . nurnber of data prior to the, weighted vector quantization. 
[0055] The second encoding unit 120 is hereinafter explained. Jhe second encoding urijt 120 is of the so-called 

40 CELP (code excited jinear prediction) and.is especially used for encoding the unvoiced portion of the input speech sig- 
nal. In this CELP encoding configuration fpn the unvoiced speech portion, a noise output corresponding to the LRQ . 
residuals of the unvoiced speech as a representative output of the . npise codebook, or a so-called stochastic codebook 
121, is sent through a gain circuit 126 to the psychpacoustically weighted synthesis filter 122. The weighted synthesis,^ 
filter 122 L PC-synthesizes the input noise to send the resulting signal of the weighted unvoiced speech to a subtracter 

45 123. The subtracter is fed with speech signals, supplied.from the input te mi inal 101 via a highrpass filter (HPF) 109 and i 
which has beeri psychpacoustically, weighted by, atpsychoacoustically .weighting filter 125, to teke out ;a difference or 
error from a signai'fr:om the synthesis filter 122. It is noted that a zero input responsepf the psychoacoustically weight? 
ing synthesis filter is to be subtracted at the outeet from an output. of the.psychoacousticaily weighting filter 125. This 
error is sent to a distance calculating circuit, 124 to make distance calculations to search a representative value vector 

50 which minimizes the error by the noise codebook 121 . It is the time dpmaip. waveform, which employs the closed loop; 
search employing in turn the analysis by, synthesis method, that is vector quantize^^^ , v . i - -..^ 

[0056] As date for UV (unvoiced) portion from the second encoding unit 120 employing the CELP encoding config- 
uration, the shape index idSI of the codebook frorn the noise codebook 121 and the gain index idGl of the codebook. 
from a gain circuit 126 are taken out. The shape jnd^x idSI, which is the UV data frppn the noise codebook 121 , is sent 

55 through a switch 127s to ah output terminal l 07s, whilst the gain index. idGl. which is the UV data of the gain circuit 126, 
is sent via switch 1 27g to an output terminal 1 07g. . .*.•,. :. ^ ■ 

[0057] These switches .127s, 127g and the above-mentioned switches 117; 1.18 are pn/off control led. based on the 
results of V/UV discrimination frpm the V/UV decisior? unit 115. The switches 1 17. 118 are turned on when the resulte 
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of VAJV decision of the speech signals of the frame now about to be transmitted indicate voiced sound (V)," whilst the ' ' 
switches 127s, 127g are turned* on when the speech: signals iofthe frame now about to be transmitted are Unvoiced 
sound (UV). • • "vpc;. .3 i. - ' : " - . . • . . 

[0058] The parameters outputted by ^thre ' above-described sjseech encoder 3. that is LSP parameters * LSP, * 

5 voiced/unvoiced discriminating parameters VUV, pitch pajnameters PCh; codebook parameter idS and the gain index 
idG of the spectral envelope, noise codebook parameter idSI arid the gain index idG'l are classed into 2k/4 kbps coding 
and the numbers of allotted bits are also indicated, as shown In Rgs.4 and 5. Fig.4 shows parameters common tb 2kbp^ 
and 4 kbps, whilst Fig.5 shows parameters proper only to 4 kbps. The parameters shown in Figs.4 and 5 are those per ' 
frame. *^ - 

10 [0059] The LSP parameters are classed into LSPO. LSP2, LSP3, LSP4 and LSPSl The LSPO is the codebook index 
of the order-ten LSP parameter, and is used as a basic parameter of the envelope. In a frame of 20 msec, 5 bits are 
allotted to the LSPO. The LSP2 is a codebook index of an LSP parameter for order-five low frequency raiige error cor- 
rection and has 7 bits allotted thereto. The LSP3 is a codebook index of an LSP parameter for order-five high frequency 
range en'or correction and has 5 bits allotted thereto. The LSP5 is a'codebook index of an LSP parameter for order- ten 

15 full frequency range en-or con-ection and has 8 bits allotted thereto. Of these, LSP2. LSP3 and LSP5 are Indites used 
for compensating the error of the previous stage and are used'supplementarily when the LSPO has not been able to 
represent the envelope sufficiently The LSP4 is a 1 -bit selection flag for selecting whether the encoding mode at the 
time of encoding is the straight mode or the differential mode. Specifically, it indicates the selection of a mode with a 
smaller LSP difference between the LSP of the straight mode as found by quantization and the LSP as found by the * 

20 quantized difference. If the LSP4 is 0 or 1, the mode is the straight mode or the differential nribd^ ' " 

[0060] The VUV parameter is a flag indicating whether the encoded ^speech diata in a pre-set frame is voiced or ' 
unvoiced (voiced/unvoiced), and has two bits allotted thereto. ' - 

[0061] The PCH parameter is a pitch parameter, arid is pitch data from the high-precision pitch seairch unit 146 by' 
the closed loop as discussed above. To this pitch parameter are allotted 7 bits. • » 

25 [0062] If the codebook parameter idS of the spectral envelope is 2 kbps, the codebook parameter is divided into 
zeroth LPC residual spectral codebook index represented as idSO and the first LPC residual codebook index repre- 
sented by idSI. The zeroth and first LPC residual spectral codebook Indices are indiciss associated with the respective 
codebooks. The LPC residual spectmm is fomried by addition of the two codebooks selected therefrom, the idG is the 
LPC residual spectral gain codebook index and has five bits allotted thereto. If the codebook parameter idS of the spec- 

30 tral envelope is 4 kbps, it is divided into the zeroth extension LPC reisidual spectral codebook index, represented by 
idS0_4k, first extension LPC residual spectral codebook index, represented by idSI _4k, second extension LPC residual 
spectral codebook index, rejaresented by idS2_4k and third extension LPC residual spectral codebook index, repre- * 
sented by idS3_4k. To the idS0_4k. idS1_4k, idS2^4k and to the idS3_4k, 7, 1 0, 9 and 6 bits are allotted, respectively 
[0063] It is noted that idS0_4k and sd on con-ect en-or components between the quantized LPC residual spectrum 

35 obtained with idSO, idSI and idG, and the correction range is allocated from the low frequency range to the high fre- 
quency range towards idS3_4k. 

[0064] The remaining paranneters are used for unvoiced sound. Specifically, the parameters with SL, such as 
idSLOO or idSL1 1 . denote the noise codebook Indices, whereas those with GL, such as idGLOO or idGLl 1 , Indicate the 
noise codebook gain codebook indices. To the idSLOO, idSLOl , idGLOO and to the idGLOl , 6, 6, 4 and 4 bits are allo- 
40 cated. respectively. 5 bits are allocated to the idSLI O, IdSL1 1, idSLl2 and to the idSL13, whilst 3'bits are allocated to 
idGLIO, idGLII; idGL12andtotheidGL13. • - 

[0065] The parameters shown in Figs.4 and 5 are divided by the class division and inputting sequence decision unit 
23 into plural classes, depending on the psychoacoustic sensitivity of the'bits td the transmission path errors. 
[0066] If the bit rate is 2 kbps, with the transmission rate of 3.5 kbpis. with the transmission 'rate being 3i.5 kbps. the 
45 class division and inputting sequence decision unit 23 divides the voiced sound and the unvoiced sound Into e.g., 'six 
classes, as shown in Figs.6 and 7, in which the suffices "p" and "c" denote the previous and current frames, respectively. ' 
That is, two frames, that is the previouis frame p and the' ciirrenr frame c, are considered here. The ismailer the class ' 
number, the more crucial is the bit. 

[0067] Referring td Fig.6, the case of the voiced speech is explairied. With the order-ten codebook index LSPO of 
50 the LSP parameters, the entire five bits of both the previous frame p and the current frame c are of th6 class I. In both 
the previous frame' p and the current frame c of of the codebook index LSP2 of the order-five low frequency range error 
con-ection of the LSP parameter, two bits are of the class I, with the remaining five bits being oif the class VI. With the 
codebook index LSP3 of the order-five high frequency range en-or correction of the LSP parameters, one of the feve bits * 
of both the previous frame p and the current frame c are of the class i. with the remaining four bits being of the class VI. 
55 With the straight mode/ difference mode selection flag LSP4 of the LSP parameter, both the previous frame p and the 
cunrent frame c are protected by the class I. 

[0068] The two bits of the voiced/unvoiced flag are protected by the class I in both the previous frame p and the cur- 
rent frame c. As for the pitch parameter, 6 of seven bits are of the class I for both the previous frame p and the cun-ent 
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frame c, with the remaining one bit being.of.the ,class Vl> pri ^akt- wvir; ;> - ^i- ' ' •■' • • . • 

[0069] The 5 bits of the'LPGiresidual spectral gain. codebook index idG/are protectedias being of the-class I for both ^ 
the previous frame p and the current frame c. The zeroth LPC residual spectral codebook index idSO of the previous 
frame p and the.current frame; c are all.of the class 11. howeyef, the zeroth -LRC residual spectral codebook index idSO 
5 of the current frame q ,are all of the class. I V. On the other, hanjd, the -fq.un bits, of the first LPC residual spectral codebook 
index idSI of the previous, frame p are ali of theclass iil, hovyever, four .bits, of sthe ifirst LPC- residual spectral codebook 
index idS I of the current frarrie c are ail of the. class V. ^ . \ > r,- • - / ^ . . ' : ■ 

[0070] Referring to Fig,7,-the case of the bit rate of 2 kbps, with the transmission rate being 3.5 kbps, with the sound 
being unvoiced, is hereinafter explained. 

10 [0071] In the codebook index LSP2 for.correctlng the order-five .low f requency range error of the.LSP parameter, 4 
of seven bits of both of the previous frame p jand the current frame, c are pf class I, with the remaining three bits being 
of the class VI. In the cpdebopk index LSP3,for correcting .the prder-five high frequency range error of the LSP: param- 
eter, two of five bits of both of the previous frame p and the current frame c are of class I, with the remaining three bits 
being of the daks VI. The straight mode/ differential rnode selection flagris protected with the class I for both of the pre- 

15 vio us frame p and the. .current frame c. . , n r * « ."i , 

[0072] The two bits of the voiced/unvoiced sound flag yUV>are. protected, with the class 1 for both of the previous 
frame p and the current frame c. ^ 

[0073] The four bits pf^the noise codebook gain codebook index idGpO are protected with the class I for both of the 
previous frame p and the current frame. c. The four bits of the noise, codebook gain codebook index idGOl are protected 

20 with the class I for both of the previous franie p and the current frarne c. The six bits of the noise codebook gain code- 
book index idGOO^ are, sist. to ..the class VI .for both of the previous frame p and the current frame c. The six bits of the 
notse codebook gain codebook index idGOl are set the class I for both of the previous frame p and the current frame 
[0074] In this unvoiced speech, part of bit? of the class VI are protected as bits of the classes from II to V, However, 
if an error is detected, no measures are taken, as in the case pf the bits, of the other class VI, 

25 [0075] If the bit rate is 4 kbps (transmission rate, 6.2 kbps), the class division, and inputting sequence decision unit 
23 divides the voiced sound and the unvoiced sound into e.g., seven classes,^ as shown in Figs, 8 and 9, In these figures, 
suffices "p" and "c" again denote the previous and current frarnes, respectively. That is, two frames, that is the previous 
frame p and the current frame c,.are considered here. Here, again, the smaller the class number, the more crucial is.the 

bit. ' ■ ; J. , /. . ' [ , ] . ..\ ...... ... 

30 [0076] The case in which the sound is voiced is explained with reference to Fig. 8. In the order-ten codebook index 
LSPO of the LSP parameter, five, bits of both previous frame p and the,current frame c are. of the class I. In the codebook 
index LSP2 for correcting the order-five low frequency range error of the LSP parameter, four of seven bits of the previ- 
ous frame p and the current frame. c . are pf the class I, with the remaining three bits being of the class VII. In the code- 
book index LSP3 fprpon-e.cting the,order-flve high .frequency mnge error pf theXSP parameter, each one of five bits of 

35 the previous frame.p and the current frame c is of the class .1, with. the. remaining four .bits being of the dass VII. The 
straight mode/ differential mode selection flag LSP4 of the LSP parameter is protected with the class I for both of the 
previous frame p and the purrent frame c. ^ . ■ , c 

[0077] The two bits. of the voiced/unvoiced sound fang VUV are protected with the class I for both of the previous 
frame p and the current frame, c. On the. other hand,. six of seven bits of the pitch, parameter PGH for both the, previous 

40 f ramep and the current frarne care set to the class J, with .the renriai^^^ 

[0078] In addition, five bits of the LPC residual spectral gain codebook index idG are. set to the class I and pro-: 
tected. On the other. hand, all pf the four bits of the zerpth.LP^C- residual ^spectral codebook index tdSO of the previous, 
frame p are set to the class. Ill, whilst all pf the.four Ipits of the zerpth L,PC, residual spectral codebook index idSO of thp;., 
current frame c^are.set to the class V. The four bits of the first LPC residual, spectral codebook index idSI of the previous 

45 frame p are all set to the class |V, hovyever, the four bits. of, the.first LpC residual spectral gain codebook index idS1 pf 
the current frame. c are all set to^ class .Vi., , - r . v m» ^ ^ ^ * 

[0079] On the other'hahd, five of seven bits of the zieroth extended LPC residualrSpectral codebook index idS0^4k 
for both the current frame c and the current frame c are set to the class I, however, the remaining two bits are .set to the 
class VII. Each one of ten bits of the first extended LPC residual spectral .codebook index idS1_4k for both the previous 

50 frame p and the current frame .c is set to the class II, however, the rernaining,nine bits are set to the class jl.. One of riine, 
bits of the first extended LP.C residual spectral cpdebook index ldS2_i4k for both the previous frame p and*the current, 
frame c are set to the class I. whilst the remaining nine bits are set to the clasfs II. One of six bits pf the third. extended 
LPC residual spectral codebook index idS3_4k for both the previous frame p and the current frame c are set to the class 
I, whilst the remaining eight bits are set to the cl^jss 11. Moreover, each one of six bits of the third extended l^PC residual 

55 spectral codebook index idS3_4k for both the previous frame.p and the current frame cis set to. the class I, whilst the 
remaining five bits are set to the class II. . . 

[0080] Refening to F^ig.9, the case in which tt;ie sound is the unvoiced, with the bitrate being .4 kbps (transnriission 
rate, 6.2 kbps). In the order.ten codebook.index LSPO.of the LSP parameters, all.pf fjye bits pf the previous frame p and 
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the current frame c are set to the. class 1. . . ; . .j.. > . , » > . . . ^ . v. . ^ r , , , . . 

[0081] In the codebook LSP2 for correcting the low frequency range error correction of the LSP parameters, four of 
seven bits of the for both the previous frame p andj^he, eg rre^^^ are set to the class I, with the remaining three 

bits being of the class Vll. In the codebook index LSP3..of.the prdj5r-five high frequency range-error correction of the- 

5 LSP parameters, one of five bits of both the previous. frame p.and the currentframe c are set to the class I, with the 
remaining four bits being of the class VII. In the straight mode/ differential mode selection flag LSP4 of the ,LSP param- 
eters, both the straight mode/differential mode selection-flag LSP4 are protected with theclass I; In the codebook index 
LSP 5 of the order-ten full range error correction, one of. eight bits of, both the previous frame p and the current frame c 
are set to the class I, with the remaining seven bits being otthe class VII. . . - 

10 [0082] The respective two bits of the voiced/unvoiced,sound fang VUV are protected for both the previous frame p 
and the current frame c with the class I. ; 

[0083] The respective two bits of the. voiced/unvoiced sourid flag. VUV are set for both the previous frame p and the 
cun-ent frame c to the class I and protected. The four bits otthe noise codebook gain codebook index idGLOl are all set 
to the class I for both the previous frame p and the currentframe c.and protected. , 
75 [0084] On the other hand, six bits of the noise codebookjndex idSLOO are set for both the previous frame p and the 
current frame c to the class VM and protected, whilst six bits of the noise codebook gain codebook index idGLOl are set 
to class Vll for both the previous.frame p and the current frame c... 

[0085] Three bits of the noise codebook gain codebook index idGL1 0 are set for both the previous frame p and the 
cun-ent frame c to lass It and protected. Also, three bits of the noise codebook index idGL1 1 are protected as class I for 

20 both the previous frame p and the current frame c. Three bits of the noise codebook gain codebook index idGLI 2 are 
set for both the previous frame p and the current frame c to class I. The two of three bits of the . noise codebook. index 
idGLI 3 are all set to the class I for both the previous frame p and the current frame c, however, the remaining one bit is 
set to the class VIL • . - - , . i , 

[0086] All of five bits of the noise codebook index idSLI 0 are set to class VII for both the previous frame p and the 

25 cun-ent frame c. Five bits of the noise codebook indexJdSL1 1 are all. set to the class VII for both the previous frame p 
and the cun-ent frame c, whilst five bits of the noise ^codebook index idGLI.3 are set to the class Vll for both the previous 
frame p and the current frame c. • r . ^ - ; : • . 

[0087] Here again, part of bits of the class yil are pnstected as bits ofthe class II to class Vl. If the bits are in error, 
no measures are taken such that the bits in en;or are .handled. in the same way.as bits of. the. other class Vll. 

30 [0088] The inputting sequence of respective parameters of the voiced and unvoiced sound.of-2 kbps from the class 
I to the class VI to the transmission path encoder (channel coder) 4 is explained with .reference to Figs.1 0 and 1 1 .The 
inputting sequencers again determined, jDy the class division, and inputting sequence decision, unit 23, as discussed 
above. The bit arraying sequence is responsive to the. psychoacoustic sensitivity of the k>its to :the transmission path 
errors. In these figures, suffices ?p" and "c" again. deriote the prwious and current frames, respectively, whilst the bit 0^ 

35 denotes the LSB. 

[0089] In Fig, 10, the. inputting sequence.of the class I . of the voiced sound is the first bit of the. voiced/unvoiced 
sound flag VUV of the previous frame p, followed by the.zeroth bit of the voiced/unvoiced sound flag VUV of the previous 
frame p, followed by the zeroth bit of the straight mode/differential mode selection flag LSP4 of the previous frame p, 
followed by the fourth, third, second, first. and zeroth bits of the LPG residual spectral gain codebook. index idG of the 

40 previous frame p. fourth, third, second, first and the zeroth bits of the order-ten codebook index LSPO of the LSP param- 
eter of the previous frame, fojiowed by sixth, firth, fourth, third, second, first and the zeroth bits of the pitch, parameter 
PCH of the previous frame p, in. this order. The inputting sequence up to this point is from No.O to No. 18 -to the trans- 
mission path encoder 4. From the.No.19 on, the inputting sequence coritinues in the sequence of the sixth bit of the^ 
codebook index LSP2 of the correction of the order-five low frequency range error correction of the LSP parameter of 

45 the previous frame p, as No. 1 9, fourth bit of the codebook index LSP3 of the correction of the order-five high frequency- 
error of the LSP parameter of the previous frame, as,.No.20 , and the fifth bit of the codebook index LSP2 of the. LSP * 
parameter of the previous frame p, as No.2'!. The. inputting sequerice, is further continued by the parametera of the cur-- 
rent frame c, in such a manner that the sequence of No.O to No,21 is repeated as No.22 to No. 43:' ' 
[0090] The inputting sequence of the class II bits.of the/ voiced speech (Nos.44 to 47) of the class II bits of the 

50 voiced sound is four bits of frorp the third to zeroth bits of the.zeroth LPG residual spectral codebook index idSO of the 
previous frame, p. determined as shown in Fig. 10. : - 

[0091] The inputting sequence of the class III bits of the voiced speech (Nos.48 tO:51) of the class III bits of the 
voiced sound is four bits of from the third:to zeroth bits of the first LPC residual spectral codebook index idSI of the pre- 
vious frame p, determined as shown in Fig. 10. 

55 [0092] The iriputting sequence of the class IV bits of the voiced speech (Nos.52 to 55) of the class IV bits of the 
voiced sound is four bits of from the third to zeroth bits of the zeroth LPC residual spectral codebook index idSO of the 
previous frame p, determined as shown in Fig. 1 0. . . • 

[0093] The inputting sequence of the class .V-bits of the.voiced speech (Nos.56 to 59) of the class V bits of the 
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voiced sound is fourbits of from the third to zeroth bits of the first LPC residual spectral codebodic ind^ IdSI'bf the cur- 
rent frame c, determined as^shovyn in Fig. 10. * . •"rrL-.; • .v^ r-Nr; t'l'U'f'.iir'iVVV^^^ .... . h, 

[0094] The inputtirig siBquence of the class'VI bits of the voiced speedh '(frorTl''fvi^^^ Nd79) iis the fourth, third, 
second, first and the zeroth bits" of* the.^codeboblc index LSP2'6f thiB order-five low fr^quehcy^Vahge error correction of 

5 the LSP parameter of the previous frarhef R in this order. The' inputting 'sequence' u^^ is that for Nb.60 to 

No.64 to the transmission path 'encbde'r 4/The inputting sequence iis^htihued by the third, second, first and zeroth bits 
of the codebool<.index LSP3 of the order-five High frequency range ertoi' c56 of the' LSP jiarameter of the previ- 

ous frame R as No.60 to No.64, in this order The inputting sequence is further continued by the third, second, first and 
zeroth bits of the codebook index LSP3 of the order-five high frequency range error correction of the LSP parameter of 

10 the previous frame R as No.65 to N6.68. in this" order. This No.69'btt Is'the zeroth bit of the pitfch parameter PCH of the 
previous frame p. The inputting sequence is further continued by the fourth, third, second, first and zeroth bits of the 
codebook index LSP2 of the order-five = high frequency 'range error cO'rr^^ of the LSP parameter of the previous 
frame p, as No.70'to No.74,' in this order. The inputting sequence is further continued by the fourth, third, second, first 
and zeroth bits of the codebook index LSP3 of the order-five high frequehcy 'range error correction of the LSP parame- 

15 ter of the current frame c, as No.5 to No. 78, in this order. The \asX NoVTS bit'is the zeroth bit of the pitch parameter PCH 
of the current frame Ci *• . . • . -r;."r>' ; 

[0095] Refemng to Fig. 11. the inputting sequence of the unvoiced sound is the first bit and the zeroth bit of the 
voiced/unvoiced sound flag^VUV of the previous frame p, followed' by the zeroth bit of the straight mode/differential 
mode selection flag LSP4 of the previous frame p, the third, second; first and the zeroth bits of the noise codebook index 

20 gain codebook idG LOO of the previous frame p, the third, second, first alhd the zeroth bits bf the noise codebook index 
gain codebook idGLOl of the previous fr^e p. and by the fourth, third, second, first and the zeroth bits of the order-ten 
codebook index: LSPO of the LSP parameter of the previous frame p,* In' thiis order. The inputting sequence up to this 
point is that from No.O to No.1 5 to the transmission path encoder 4. The bits of No.l 6 to No. 1 9 are the sixth, fifth, fourth 
and the third bits of the codebook index LSP2'of the order-five low frequency range error con-ection of the LSP param- 

25 eters of the previous frame p The bits of No.26 and N6.21 are the third arid fourth bits of the codeboolc index LSP3 of 
the order-five lowfrequehcy range error conrectiori of the LSP parameters of "the previous frame p; In No.22 to No.4S. 
the parameters of the current frame are inputted as repeating the above-mentioned sequence from Nds.O to 21 .' 
[0096] The inputting sequence of the dass-ll of the unvoiced sound is three bits from the second to the zeroth bits, 
totalling at three bits, of the codebook inaex LSP2 of the order-five 'low frequency range error correction of the LSP 

30 parameters of the previous frame p and the second bit' of the codebook index LSP3 of the order-five low frequency 
range error correction of the LSP parameters of the previous frame p (fi'om No. 44 to No.47). 

[0097] The inputting sequence of the class III of the unvoiced'souhd is the first (No.48) and zeroth bit (No.49) of the 
codebook index: LSP3 of the oroe'r-five low frequency rangis error correction - of the LSP paranrieters 'of the previous 
frame p. totalling at three bits.4he fifth bit (No;50) and the fburth bit (No.Sl j of thb noise codebook index IdSLOO of the 
35 previous frame p. 

[0098] The inputting sequence of the class IV of the unvoiced sound is the second to zeroth bits of the order-five 
low frequency range en-or correcoon of the LSP paranieters of the current frarhe c; totalling at three bits, and the second 
bit of the codebook index LSP3. of the order-five low frequehcy range error correction (Nos.52 to 55). . wv -. 

[0099] The inputting sequence of the class V of the unvoted sound is the first'bit (Nd.56) and the 'zeroth bit (No.57) 

4o of the codebook Index LSP3 of the order-five low frequency range errbi" connection bf the LSP parameters of th4 current 
frame c. The No.58 and.No.59 are the fifth *^and fourth bits of the'noise codebook'idSLOO of the cun-ent frame c: 
[0100] The. inputting sequence of the class IV bits of the unvoiced sound (frorh No.60 to N0.79) is the third, second, 
first and the zeroth bits of the noise codebook idSLOO of the previous' friame p; The inputtirig sequence up to this point' 
is that for the NoiSO to No.63 to the transmission path encoder 4'.' In the inputting sequence, No.64 to N6.69 are the fifth, 

45 fourth, third, second.rfirst and the 'zeroth bits of the' noise codebook idSLOO of the previous frahie p. The mputtlriy' 
sequence Is continued by:the1hird.. second, first and the zer^oth bits of the^noise codebook idSLOO of the current frame 
c, as No. 70 to No73; The inputting sequence rs further cbntiiiued by' the fifth, fourth, third, second, first and the zeroth- 
bits of the cun-entframe c. as No.74 to No:79.' ^ i . . ' • ^ . < . 

[0101] The inputting sequence to the transmission path encoder 4 (charinel encoder) bf the respective parameters 

so of the voiced and unvoiced sounds bt from class I to class Vll of 4 kbps is^showh In Figs.12 to 14 and in Figs.15 to 17. 
The inputting sequence here is detemiined by the class division and inputting sequence decision unit 23, as described 
above. The bit anaying: sequence is responsive to the psychoacoustic sensitivity of the bits to the transmission path 
en-ors. In these figures, suffices "p'' and "c" again denote the previous and culreht frames, respectively. Whilst-the bit 0 
denotes the LSB. ' ' 

55 [0102] Fig. 12 shows the inputting sequence of the class I of the voiced sound. The inputting sequence of the sum 
total of 66 bits of from No.O to No.65 is set. This inputting sequence lis the first bit of the voced/unvoiced sound iflag VLIV 
of the previous frame p, followed by the zeroth bit of the voiced/unvoiced sound flag VUV of the previous frame p. zeroth 
bit of the straight mode/differential mode selection flag.LSP4 of the LSP parameter of the previous parameter p, and by 



11 



ISDOCID: <EP 10eiS03A2J_> 



EP1 061 503 



the fourth, third, second, first and the zeroth bits of the LSP residua) spectral gain codebook index idGO of the previous 
frame p, in this order. The inputting sequence is. continued t)y the fourtti, third, seconjd, first; and the^ zeroth bits of the 
order-ten codebpol< index LSPd bf.the LSI=^; parameter ^^^^^^ previous frame^ The inputting sequence is further contin- 
ued by sixth, fifth, fourth, third, second and first bits of the pitch parameter PCH of the previous frame p. The inputting 

5 sequence up to this, point is that from.No.0 to.No.l8 to the transmission path encoder 4. Fronrt the.No.l 9 on, the inputting 
sequence is continued by the sixth, fifth (No.20), fourth (Nd21) and third bit (No.22) of the codeboolclndek LSP2 of cor-! 
rection of the low frequency range error of the LSP parameter. of the previous frame. The inputting sequenqs further is 
continued by the sixth, fifth, fourth and the third bits of the zeroth extended LPC residual spectral codebook index 
idS3_4k of the previous frame p as No.23 to No.27. For the Mo.2i3 and No.29, the fourth bit of the codebook index LSP3 

10 of the correction of the order-five high frequency range errpr of the previous frame p and the seventh bit of the of the 
codebook index LSPS of the LSP parameter pf the correction of the order-ten full range errpr correction of the previous 
frame p are an^yed. For No.30 to No.32. the ninth bit of the first extended LPC 'residual spectral codebook index 
idS3_4k, eighth bit of the second extended LPC residual spectral codebook index idS3_4k and the fifth bit of. the third 
extended LPC residual spectral codebook index idS3_4k. For the No.33 to No.65, the parameters of the current frame 

75 c are inputted in a manner of repeating the sequence frorri ISIo.6,to Np.32. 

[0103] The inputting sequence of from class II to class V. of the voiced speech is shown in Fig. ^ 3.:A sum total of 
44 bits of the class 11 are inputted in the sequence of from No.66 to No.109. For No.66 to No. 74, the eighth, seventh, 
sixth, fifth, fourth, third, second, first and the zeroth bits .of the first extended LPC residual spectral codebook index 
idS3_4k of the previous frame are arrayed. For No. 75 to Np.82, the seventh, sixth, fifth, fourth, third, second, first and 

20 the zeroth bits of the second extended LPC residual spectral codebook index. idS3_4k of the previous frame ara -^ 
arrayed. For No.83 to No.87. the fburtht third, second, first and the zeroth bits of the third extended . LPG residual spec-r 
tral codebook index idS3_4k of the previous frame are arrayed. For No.88 to No.1 09, bits relevant to the current frame 
c of a sum total Of 22 bits from No.66 to No.87 are repeated. . 

[0104] For four bits of the class III. the third, second, first and the zeroth bits of the zeroth LPC. residual spectral 

25 codebook index idSO of the previous frame p are. arrayed as. No.1 1 0 to No.1 13. . 

[0105] For four bits of the class IV, the third, second, first and the zeroth bits of the zeroth LPC residual spectral 
codebook index idS1 of the previous frame p are anayed as No.J 1 4 to No.1 ^7., > . . i v . ^ ! - . : 

[0106] For four bits of the class. V, the third, second, first and the zeroth bits of the zeroth LPC residual spectral 
codebook index idSO of the current.frame c are arrayed as No. 1.1 8 to No.1 21 . 

30 [0107] The inputting sequence of the classes VI and yil of the voiced sound is shown in Fig. 1 4. A sum total of four 
bits of the class VI for No.1 22 to No.1 25 are arrayed in the sequence of the thjrd, second, first and the zeroth bits of the 
first LPC residual spectral codebook index idSI of the current ^frame c. - . - 

[0108] A sum total of 34 its of the class VI I, are allotted in No. 126 to No. 159 as follows: That Is, for No. 126 to No.1 28, 
the second, first and the zeroth bits of the codebook index LSP2 of the prder-fiye low frequency range error con-ection 

35 of the previous frame p are arrayed. For No.1 29 to No. 132, the third, second, first and the zeroth bits of the codebook 
index LSP3 of the order-five low frequency range error correction of the previous frame p are arrayed.' For No.1 33 to No.. 
139 are arrayed the sixthi fifth, fourth, third, second, first and the zeroth bits codet)ook index LSP5 of the LSPparame- . 
ters of the order-ten full-band error conection. For No.14Q is allotted the zeroth bit of the pitch parameter PCH , of the 
previous frame. ;For No. 141 and No.142, the first and, zeroth; bits of the zeroth extension LPC residual spectrum code- 

40 book index idS0_4k are allotted. For No. 143 to No.1 59 are repeated bits relevant to the current frame c of a sum total 
of 17 bits from No.1 26 to No.142. ; , 

[0109] The inputting sequence of the classes VI and VII of the, voiced sound is shown in Fig.15. The inputting 
sequence of a sum total of 66 bits from No.O to No.65 are detemnined. The inputting sequence is the first and zeroth 
bits of the voiced/unvoiced sound flag VUV of the previous frame p, followed by the zeroth bit of the voiced/unvoiced 

45 sound flag VUV of the previous frame p, followed by the zeroth bit of the straight mode/differential mode selection flag 
LSP4 of the LSP parameter of the previous frame and the thirds second,, fir^ and zeroth bits of the noise codebook 
index gain codebook idGLOO of the previous frame p, by the third, second, first and zeroth bits of the noise codebook 
index gain codebook idGLOl of the previous frame and by the fourth, third, second, first and the zeroth bits of the order 
ten codebook index LSPO of the LSP parameter of the previous frame p, the inputting sequence up to this point is that 

50 from No.O to Np.1 5 to the transmission path encoder 4. The bits ;No.16 to No.1 9 are the sixths-fifth, fourth and third bits 
of the codebook index LSP2 of the comection of the tow frequency range error of the LSP parameters of the previous 
frame p. For bits No.20 and No.21 . the fourth bit of the order-five high ifrequency range error correction of the previous 
frame p and the seventh bit of the codebook index LSP5 of the LSP parameter of correction of the order ten full wave 
error of the previous frame p are arrayed. For the bits No.22 to No.24, the second, first and zeroth bits of the noise code- 

55 book index gain codebook idGLI 0 of the previous frame p are arrayed. For the bits No.22 to No.24, the second, first and 
zeroth bits of the noise codebook index gain codebook idGLI 0 of the previous frame p are arrayed. For the bits No.25 
to No.27, the second, first and zeroth bits of the noise codebook index gain codebook idGL1 1 of the previous frame p 
are arrayed. For the bits No.28 to No.30, the second, first and zeroth bits of the noise codebook index gain codebook 
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idGL12 of the previoujs frame pare aitayed/-^^^^ .;f..v }, ... 

[0110] For'the bits' Nos.3 i ar\d 32; the second and first bits 'of the noise 'pocleBdolc irideii^ ^gain ^codelaook idG LI 3 of ' 
the previous frame'p are arrayed/ For the bits N6.33'to 65.' the parahieitere'of the'ciirre are inputted to repeat ' 

the sequence of the bits Kibsyo to 32/ * ' * . ^ ^-^riiw^^' '-^ iisri','^i{) ^• ■ •J - ■' . 'y* ' 

5 [01 1 1 1 The inputting sequence of the classes 1 1 and II I of the yoiced^sou^^^^ iVshoWrf in Fig. 1 6. Although hot shown 
in Fig.9, part of the class Vll is protectied with the CRC'as bits of the classes II to VI, however, if error occurs; no nneas- 
ures are taken and the bits are handled similarly to th^ bits of the' class' VII'. First;- as the bit N6.66, the zeroth bit of the 
noise codebook index gain codebook idGL13 of the previous f rame'p'are'entei-ed. For the bits Nbs.67 to 69. the second, 
first and the zeroth bits-of the codebook index LSP2 of fconredtioh of the' order-five low frequency range error of the pre- 

10 vious frame p are arrayed; For the bits No.TO to No. 73/tKe third, second, first and the zeroth bits of the codebook index 
LSP3 of correction of ihe'order-fivd high frequency range errbi^ of the previous franrie p are arrayed. For the bits No. 74 
to Nc.SC. the sixth, fifth, fourth, third, second, first and the zeroth bits of ttie codebook index LSP3 of correction of the 
fu'l frequency range error of the previous frame p are anByed. For the bits' N to No.86, the sixth, fifth, fourth, third, 
second, first and the zeroth bits of the noise codebook index gaiin codebbbR idGLOO of the previous frame p are arrayed. 

15 For the bits No87. the fifth bit of the noise codebook index gain codebook IdGLGI of the previous frame p are arrayed. 
For the bits Nc.88 to 109, the parameters of the current frame c are 'inputted to repeat the sequence of the bits Nos.66 
to 87. • r >. ' ■ ■ i ' ' ■ ^ ' ■ •■ 

[0112] As ihe sum total of four bits of the class III of the unvoiced sound, the fourth, third, second and the first bits 
of the noise codebook index gain codebook idGLOl of the previous frame p are arrayed for the bits No: 110 to No.1 13. 

20 [0113] The> inputting sequence of a sum total of 46 bits from classiV to class VII of the unvoiced sound is shown in 
Fig. 17. Although not shown^ in Fig.9r'part*of the class VII is prbtiected with the CRC as bits of the classes II to VI, how- 
ever, if error occurs; ho measures are taken and the bits are handled similarly to the bits of the class VII. 
[0114] First, as bit No.1 14 of the class IV. the zeroth bit of the noise codebook index gain codebook idGLOl of the 
previous frame p is arrayed for the bit No.TI 4 of the clasis IV. Forthe bits Nos.115 to 117, the fourth, third and the 'sec- 

25 ond bits of the noise codebook index gain codebook idGLOl of the previoliis-frame p are arrayed. • ' 

[0115] For a sum total of four b'lts-of the cliass V, the fourth, third, secfond and the first bits of the noise codebook 
index gain codebook idSLIO of the previous frame p are arrayed as^he bits Nb;1 15 to Nd ^ 
[0116] As the bit No. 122 of the class' VI, th^ zeroth bit of the noise cddebdoklndex^ gain codebook idSLIO of the'' 
cun-ent frame c are arrayed as No. 122 of the class VL For the bits No. 1 23 to No. 125, the fourth, third and second bit^ 

30 of the noise codebook index gain codebook idSLI Oof the cun-ent f rahne are arrayed. ' ' ' * ' ■ 

[0117] A sum total of 34 bits of the class VII is allotted as Nos.126 to 1 59 as follows: For the bits No.128 to No.t32, 
the fourth, third, second, first and the zeroth bits of the noise codebook' index gaih codebook idSLIO of the previous 
frame p are arrayed. For the bits N6.1 33 to No.1 37, the fourth, thirid,^ second, first and the zeroth bits of the noise code- 
book index gain codebook idSLT2 of the previous frarrie p are anayed. For the bits No: 138 to No. 142, the fourth, third, 

35 second, first and the zeroth bits of th6 noise codebook index gain codebook idS LI 3 of the previous franie p are ariayed. 
Forthe bits No: 143 to 159, the bits relevant to the current frame c of 17 bits of from N6.126 to'No,142 are repeated. 
[0118] The encoding parameters of the voiced and unvoiced sounds with the bitrate of 2 kbps are inputted at' an 
inputting sequence shown -in ^ Figs. 10 and 11 to the tifarismission patK encoder 4 shown In Fig. 1. On the other hand; • 
the encoding parameters of .the voiced and unvoiced sounds With the' bitrate of '4 kbps are inputted in the inputting* 

40 sequence shown in Figs. 12 to 17 to the transmission path enboder'4 of Fig. 1. % ' • 

[0119] The CRC code calculating unit 5 of the transmission path encoder 4 then finds the ORG code CRCp], if nec- 
essary, by a CRC polynorhinal shown in the following iequatibn(l): •• ' ■-• 

18. r. ' .J • • ■: ■ 

-■.'.f -v;: :/=6' " • ^ ^ - ^ .< < ^ ^ v ■ • ^ " ? . 

'■• .^'.'^ M^T- ••'....J. I ' ■ ' ' ■' 

50 for class I, depending on the difference in the bitrate of2'kbps/4 kbps and ^ on the classification. In the class II and so 
on, CRC codes are similarly obtained using appropriate CRC polyhbhiirialis. ' *• " . 

[0120] In the above equation. • » ^ i; ■ ' ... 

\ . • . -J- '>;;. '■• ^ .; '» . ■'. 

55 • • x^£=P[/r- 'x - Q(x)-GctcW-»^.''W • (2) 
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and 



5 [0121] The CRC code calculating unit 5 acquii^is t^ CRC using the afaWe equations (i j to '{3), only 

for the classes 1. II, IV arid V for 2 kbps. Although both the ybjced and unyoioBd' sounds may be processed, it is. mainly 
contemplated in the present invention to adjust the output Vri the d^ in keejDing with an error^ detected 

depending on the CRC code applied to the relevant classes bf the voiced sound excluding the class \. ' 
[0122] The total number of bits from the dass I to the class Vi of the voiced speech of 2 kbps is shown in Rg. 1 8, 

10 in whfch the total number of bits from cjass I to class Vll of the speech sound of 4 kbps Is also shown, 

[01 23] With the class I of the voiced sound with 2 kbps, a^ an object, the CRC code calculating unit 5 calculates the 
audio frame 44 bits to 6 bits CRC code CRCQ] (CRC fiarity): When the class 1) is an object, the CRC code calculating 
unit 5 calculates the 1-btt CRC parity from the audio frame 4 bits.' if the class VI is an object, CRC parity is not calcu- 
lated. ' • ^ - ^ 

15 [0124] Using the CRC code CRCp], obtained by the equations (1) to (3). and the inputting sequence Pp], shown in 
Fig.10, the bit string CVinp] is fonried, as shown by the equation (^^^ ■ i. 



[CRC[f-N] N+l<r<N+P 



25 - . : . .V. - .... 



30 

[0125] With the dass I of the voiced class of 4 kbps, as an biDject, the CRC code calculating unit 5 calculates 6-bit 
CRC parity from 66 bitsof the audio, frame. With the class II as ah object, the CRC code calculating unit 5 calculates (B- 
bit CRC parity from 44 bits' of the audio frame. With the classes III to VI as an object, the CRC code calculating unit 5 
calculates 1 -bit CRC parity from 4 bits oifthe audio frame. If the class Vll is an object, the CRC parity is not calculated. 
35 [0126] Using the CRC code CRC[i], as found by the equation (1 ) to (3), and the inputting sequerice P[i], shown in 
Figs. 1 2 to 1 4, the bit string CVinp] is formed, as shown by the equation (4). 

[0127] The convolution enboder 6 applies convolution coding to the bit set to which the CRC code sent from the 
CRC code calculating unit 5 \s applied as necessary, that is the bit string CVinp]. In this encoding device, convolution 
coding is applied only to the bit string CVinp] consistent with the class I bit of the bitrates of 2 and 4 kbps for protection. 
40 [0128] The convolution encoder 6 applies cori volution coding not in need of tail bit, such as SRCPC (Systematic . 
Rate Compatible Punctured Convolutional Code) used e.g., in. H.223 Annex C, to the bit string CyiN[l] consistent with ' 
the class I bit of the bitrates of 2 and 4 kbps. Tlie cpnvolution coding in this cbrivolUtiqn encoder 6 is performed at a 
double code rate, shown as 8/1 IS in Rg.18. Thus, the total num^r of bits of 2 kbps class I is (audio fnam(e'44 bits + CRb- 
parity 6 bits) x 2 =100. bits, whilst the total nuniber of bits of 4 lips class I is (audio fr^rhe 66 bits' t CRC parity. 6 bits) ' 

45 X2=144bitS. - • - . ; ■/ 

[0129] For class II to class V of 2 kbps, only the CRC code is applied, without perfomriing the convolution coding.* 
Therefore, the total number of bits is (audio frame 4 bits.+ CRC parity. 1 bit) x 1 = 5 bits. On the other hand, since no 
CRC code Is generated in the class VI, only 20 bits of the audio frarne is the total nunnber of bits. 
[0130] Since no convolution coding is used for class II to class VI of 4 kbps, the total number of bits of class II is 
50 (audio frame 44 bits + CRC parity 6 bits) x 1 = 50 bits, as shown iri Fig, 1 8. with the number of bits of class III to class - 
VI being (audio frame 4 bits + CRC parity 1 bit) x 1 = 5 bits. For class Vli; in which no CRC code is generated, only "34 
bits of the audio frame gives the total number of bits. 

[0131] Therefore, with the 2 kbps source coder, in which the total of the entire classes is 1 40 bits, the bitrate is 3.5 
kbps. 

55 [0132] With the 4 kbps source coder, in which the total of the entire classes is 248 bits,.the bitrate is 6.2 kbps. 

[0133] The plural sorts of the encoding parameters, classed by the class division ^nd inputting sequence decision 
unit 23 depending on the psychoacoustic sensitivity of the bits to transmission path errors and protected in the trans- 
mission path encoder 4 depending on the classes, are rnodulated by the modulator 7. Output bits of the modulator 7 
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are processed for transmission by the transmitter 8 and transmitted via antenna 10. 

[0134] On the decoding device side, the bit string, as an encoded output demodulated by the antenna co-user 9, 
receiver 1 1 and the demodulator 13, is convolution decoded by the conyolution decoder 16. The decoded bit string is 
sent to the CRC code comparator- frame masking unit 16 where the CRC code appended to the convolution decoding 

5 output of the convolution depoder 1 6 is compared to the CRC en:or cprrectipn code. calculated. from the bit group exclud- 
ing the crucial bit group to adjust the convblutiph decoding output depending on the results of the comparison, 
[0135] The CRC'code comparator- frame unit 15 executes the error detection process'for Inspecting the 

transmission error Using the error correction code appended to the. cx)rivplution decoding output from the convolution 
decoding step executed by the convolution decoder 16 and an output adjustment step of adjusting the convolution. 

10 decoding output depending on the result of error Inspection at the error inspection step. 

[0136] In particular, if the CRC'cpde Is not coincident^ that is if the CRC error Is detected, and the speech is 
decoded using data of the frame in question, the speech quality is deteriorated appreciably. So, error protection 
processing (frame masking) is executed depending bh'the .degree of error concatenation. 

[0137] The frame masking state of the current frame is updated depending on the CRC decoding results of class I. 
15 Fig. 19 shows the state transition by the frame masking processing. The respective states (state 0 to state 7) undergo 
transition in the direction indicated by arrow. The triansrtloh occurs from state .0. On a transition line, "1 " and '0" indicate 
the direction of transition in case of an error-corrupted frame and the direction of transition in case of an enror-free 
frame, respectively. 

[0138] The following parameter substitution is executed depending on the state values. In an error-free state, the 
20 state value is 0 and the received speech frame bits are processed. The state 7 denotes the state of restoration. 

[0139] If, in decoding the above LSP parametera, the state variable state is "state 1" to "state 6", the LSP parame- 
ters are replaced by those of the previous frame. ., 

[0140] If, in the state 7 of restoration, LSP = 0, indicating that the LSP mode is straight type, the LSP parametera 
are calculated from the entire LSP codes. If LSP = 4 indicating that the LSP mode is of the differential type, the LSP 
25 parametera are calculated from the following equation (5): 

; LSPj^(n) = p-LSPp^(n) + (1 - p)LSPuA/(n) For n = 1 ... 10 (5) 

such that, in the differential mode, the LSP parametera from the LSPO code is interpolated with those of the previous 

30 frame. 

[0141] In the above equation. (5), LSPbase(n) is the^ LAP parameter of the basic layer, LSPprev(n) is the LSP param- 
eter of the previous frame and LSPoth(n) is the parameter "decoded* from the LSPO code. On the other hand, P is an 
interpolation coefficient arid is changed ih a range from 0.7 to 0.0 elvery 6.1 by the number (frame) of CRC error frarnes 
directly previous to the clasis iV For ^xampje, if the number of past en'or frames is 0, P = 6.7 is used, yyhh the LSPbase(n) - 
35 of the basic layer being 0.7 • LSPpreJ(ri) + 0.3 • LSPoth(n). LSP2, LSP3 and LSP5 codes are overlooked and the ISP- 
base(") 's used as the current LSP parameter.' 

[0142] A mute variable mute, contrpHing the sound volunie of the output sound depending on the value of the state 
variable state, is set as shown in Fig.21 ' For example,' if the' state variable state is 0, the mute variable mute is set to. 
1 .000. If the state variable state is .6, the mute yariable mute is set to O^OOO. For the statp varia|ble state = 7. the mute 

40 variable mute used is an average value of '1 .0 and an average value of the previous frame. However, if the average yalue 
exceeds 0.8, 0:8 is used. j . . ; . - " . ' 

[01 43] The substitution of the voiced sound parameter and the gain control are performed as follows: . '\ 
[0144] If the yoiced/unvoiced decision parameter VUV is V, and the state yariable sfafe is 1 to 6, the codebook 
parameters idSO, IdSI . LPC residiial spectral gain codebook index idG arici spectral parameters idS0_4k to idS3_4k for 

45 4 kbps are replaced'by those of the previous frahrie: For contrbirihg the sound volume of the output speech, the har- 
monic level parameters Am [00.. 127] of the LPC residual signals are gain-controlled as in the following equation (6):. 

* • AiriD] = mute •Arij (gcgjM for) ^ d.::i27 ' \ ; . . J ' " 

so where AM(org)['] has been calculated from the spectnal pa^ , . . ' 

[0145] If, in the state 7, the directly previous frame on restoration to the voiced speech is unvoiced, the following 
equation (7): • . - - ' , : .. 

' ■ ' Am[i]=: 0.6* mute'Am(o^)[i] ' • ■ • » ■ • - ' > 

55 . . 

is used in place of the equation (6). in onier to suppress the gain to maintain continuity, in other, words, to suppress 
waveform from being disturbed by difference in fdnward and aft side spectral 

[0146] Meanwhile 4 bits each of the previous frame p and the current frame 'c of the zeroth LPC resi.dual spectral 
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codebook index idSO and the.first LPC residualspectral codebook index idST are of other classes than class I and class 
VI for 2 kbps, as shown/ln 'Fig,6, so:that.,from Eig. 18- these .bits are individual ly protecte by the 1 CRG bit. If. in the 
state 0 or state 7, that is during nomrial state. or;on restorationi GRC errors of >these classes for the isame frame are 
detected simultaneously, the quantized harmonics level paramete^^ fixed dimension are converted 

5 from the original Amqnt(org)I'' .44] for suppressing the low frequency range level, as indicated by .the following, equation 

. . Amqji] = s[irAm-qrt^^^j[i] ; ■ ' ■ . ' ^ 

10 [0147] In this equation (8). sp] is a coefficient for suppressing Amq^t^ij^) and Is set as shown in ngv22. ' : 

[0148] For 4 kbps. almost all bits of the first extension LPC residual spectral code Index idS1_4k, second extension 
LPC residual spectral code index idS2_4k and the third extension LPC residual spectral code index idS3_4k are pro- 
tected as being class II bits by plural CRC bits. If an error Jsdetected in the class II, a spectral envelope of the extension 
layer ts disregarded. 

15 [0149] F or example, if the VUV decision parameter is UV, and the state variable state = 1 to 6. the noise codebook 
gain parameters idCLOO. idGLOl and the noise codebook gain parameters idGL1 0 to idGLI 3 or 4 kbps are replaced by 
those of the previous frame. In a similar case, the noise codebook gain parameters idGLOO; idGLOl nntay be replaced 
by idGLOl of the previous frame, whilst the noise codebook gain parameters idGLI 0 to idGLI 3 for 4 kbps may be 
replaced t>y idGL13 of the previous frame. The noise codebook gain parameters idGLOO, idGLOl and the noise code- 

20 book gain parameters idGLI 0 to idGLI 3 for 4 kbps used are those obtained on generating uniform random numbers 
within the range of respective numbers of bits. . . . » • . . s . ■ *: . ' 

[0150] For controlling the sound volume of the output sound, the LPC residual signal res [00.:159] are gain^ontrol-^ 
led as shown by the following equation (9): _ . ; . : . - . . ' ■ 

25 res[i] = mute*resorgril(0<i< 159) v . ^ . - • {9y 

where resfi] is found from the noise codebook parameter.. ' * 

[0151] The convolution coding output, frame-masked by the CRC code comparator-frame inasking unit 15, Is 
routed to the speech decoder 1 7. 

30 [0152] Figs.23, 24 show the configuration of the speech decoder 1 7. From the CRC code comparator- frame mask- " 
ing unit 15, the codebook indexes con-esponding to the quantized output of the LSP (line spectrum pairs) aris taken out 
via terminal 202, whilst the index, pitch and the VUVparameters as envelope quantization output are taken out via ter- 
minals 203 to 205 and the indexes the UV (unvoiced) data are taken out via ^terminal 207. The CRC error signals, 
obtained on CRC, at the ,CRC code comparator-frame masking unitIS, are routed to an unvoiced sound synthesis unit 

35 220. • : • . ... M • 

[0153] The index as an envelope quantization output from the terminal 203 is sent to a vector dequantizer 212 for 
vector quantization to find a spectral envelope of the LPC residuals which are sent to a voiced sound synthesis unit-21 1 . 
The voiced sound synthesis unit 21 1 synthesizes LPC residuals of the voiced sound portion by sinusoidal synthesis and 
is fed with the pitch and voiced/unvoiced decision output from terminals 204, 205. The LPC residuals of the voiced 

40 sound from the voiced sound synthesis unit 21 1 .are sent to an LPC synthesis filter 21 4.'Th(B index of the UV data frorh 
the terminal 207 is sent to the unvoiced sound synthesis unit 220 where the noise codebook is referred to in order to 
take out the LPC residuals as the excrtation vector of the unvoiced portion. These LPC residuals are also sent to the 
LPC synthesis filter 214 wherethe LPC residuals of the voiced portion and those of the unvofced portion are independ- 
ently processed with LPC synthesis. Alternatively, the LPC synthesis may be perfomried on the sum of the LPC residij- 

45 als of the voiced portion and those of the unvoiced portion. The LSP index from the temiinal 202' Is sent to an* LPC- 
parameter reproducing unit 21 3 to take out a-parameters of the LPG which are sent to the LPC synthesis filter 214. The 
speech signals, obtained on LPC synthesis by the LPC synthesis filter 214, are taken out at an output terminar201 . 
[0154] Rg.24 shows a moire specified structure of the speech decoder 1 7 shown iri Rg.23. In Fig.24. In the follow- • 
ing description, parts or components which are the same as those shown in Fig.23 are depicted by the sahne reference 

50 symbols, - : . .-• : • * 

[0155] To the input terminal 202 is sent the vector quantization output of LSP via the CRC code cohnparator- frame 
masking unit 15, that is the so-called codebook index. 

[0156] The LSP index.is sent to a vector dequantizer 231 of the LPC parameter reproducing unit 213 where the LSP 
data are processed with vector deque ntization. The resulting data are sent to LSP interpolation circuits 232, 233 for 
55 LSP interpolation and thence sent to LSP-> a converters 234, 235 for conversion to ct-parameters of LPC (line predic- 
tion coding). This a-parameters are sent to the LPC synthesis filter 214. The LSP interpolation circuit 232 and the 
LSP-» a converter 234 are for voiced sound (V), whilst the LSP interpolation circuit 233 and the LSP - a converter 235 
are for unvoiced sound (UV). The LPC synthesis filter 21 4 separates an LPC synthesis filter for voiced sound 236 from 
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an LPC synthesis filter.forjunvoicedsound{237.\1Tiat4srthetLPG^^^ iridependehtlyfor 
the voiced and unvoiced portions to prevent-adverse effects otherwise iprodijcecl dae'to in^^ of LSPs- of totally- 

different properties at a transition point.from voiced to unvoiced. portions or from unvoiced -to voiced portions. 
[01 57] The input terminal 203 is fed with weighting vector quantised code' index data of the'spectral envelope Am 
5 via thge CRC code comparator- frame niasking unit 16. The input terrninar2d4 is fed with data of pitch parameter PCH 
via the CRC code comparator- frame masking unit 1 5, whilst the input terminal 205 is fed with voiced/unvoiced decision 
data via the CRC code comparator- frame masking unit 15. 

[0158] Thevector-quantized index data of the spectral envelope Am from the input terminal 203 is sent to the vector 
dequantizer 212 for vector dequantization which is the back-conversion corresponding to the data number cpnversion. 
10 The resulting data of the spectral envelope is sent to a sinusoidal synthesis circuit 215 of the voiced sound -synthesis 
unit 211. • , .r,^. r v.. , . . • ■.*•*•'•.. \' ■ '\ . • - 

[0159] If franrie-tp -frame: difference is taken prior to vector dequantization in encoding, the decoding of a frame-to- 
frame differencejs performed after the vector dequantization, followed by data number conversion, to produce spectral 
envelope data. 

15 [0160] The sinusoidal synthesis circuit 215 is fed with the pitch from the input temiinal 204 and with the V/iJV deci- ' 
sion data from the input tenxiinai 205.. From the sinusoidal synthesis .circuiti215, LPC residual data, correspbnding to 
the output of the LPC back-filter 1 1 1 of Figs;2 and 3; are taken out and sent to an adder 218. The particular technique 
of this sinusoidahsynthesis is disclosed in Japanese Laying-Open> Patent :H-5-266487 and In Japanese Laying-Open 
Patent H-8-063197 filed in^theiname of the present Assignee. 

20 [0161] The. envelope data from the vector dequantizer 212 and the pitch and V/UV decision data from the input ter- 
minals 204. 205 are routed to a noise synthesis circuit 216 for adding the noise of the voiced (V) portion. An output of 
the noise synthesis circuit 21 6 is.sent.tq the adder 21 8 via a weighted weight addition circuit 21 7. The reason for doing 
this is that, since excitation which proves an input to the LPC filter of the voiced sound by sinusoidal synthesis gives a 
stuffed feeling in the low-pitch sound such as the male voice and the sound quality is suddenly changed between the 

25 voiced (V) and;the unvoiced (UV) sound to give an unnatural feeling, the noise which takes into account the parameters 
derived from the encoded speech data, such as pitch, spectral envelope amplitude, maximum amplitude in a frame or 
the level of the residual signal is added to the voiced portion of the LPC residual -signals of LPC synthesis filter input, 
that is the excitation,.of.the voiced portion: . , r v-:. \ ^ i . . . . 

[0162] The sum output of the adder 21 8 is sent to a synthesis filter 236 for voiced speech of the LPC synthesis filter 
30 214 to undergo LPC synthesis processing to produce a time domain waveform signal, which then-is filtered by a post 
filterfor voiced speech 238v and thence is routed to an adder;239. v ? < • : . : . 

[0163] The shape index arid the gain index, as UV data, are routed respectively to input terminals 207s and 207g/ 
as shown in Fig.24, and are thence supplied to the. unvoiced sound synthesis unit 220. The shape Index-from the ter- ■ 
minal 207s is sent to a noise codebook 221 of the unvoiced sound :synthesis unit 220, whilst the gain index from the 
35 temninal 207g is sent to a gain circuit 222. A representative value output read out from the noise codebook 221 is the 
noise signal component corresponding to the LPC residuals of the unvoiced sound and is of a pre-set gain amplitude 
in the gain circuit 222. . The noise signal componentithen is sent to a windowing circuit 223 where it is windowed to 
smooth out the Junction to the voiced sound portion. • - j 

[0164] An output of the windowing circuit 223, as an output.of the unvoiced sound synthesis unit 220, is routed to 

40 a synthesis filter for unvoiced (UV) sound 237 where it is processed with LPC synthesis to prove time wavefonin data 
for the unvoiced portion which is filtered by the post filterfor unvoiced portion 238u and is thence supplied to the adder ' 
239. ■: ... - / . i ... . 1 •-••o .-..-^j;-- ' ' - ' . 

[0165] In the^adder; 239, the time waveform signal of the voiced portion from the post filter for voiced portion 238v' 
is summed to thejtime;.wavefpmn data of the unvoiced portion from therpost filter for unvoiced portion 238u. The resutt- 

45 ing data is taken put atan. output terminal 201. :^ r r - • v . -r -r ' ' • - 

[0166] The.decoded speech putput, taken out at this output-terminal :201- is.converted by the D/A converter 1 8 into . 
analog signals, which^ are issued as speech via, speaker 19.(i .1 i ,v:-;. .'.n y - ' * • • .» 
[0167] Figs,25 and 26 show^a flow/chart for collectively ishowing thei encoding method and the speech decoding 
method perforrned by the portabletelephone device. I. • ♦ - ..v l.. r ; - 5. . ; .j . 

so [0168] Specifically, the speech encoding method performed by the encoding device of the portable telephone 
device is the processing of encoding an input speech signal by a speech encoding process of step SI , selecting a cru- ' 
cial bit set among the plural sorts of the speech encoding parameters from step SI. having high psychoacoustic sensi- 
tivity with respect.to the transmission path error, calculating CRC code^from the crucial bit set and convolution coding 
the CRC code calculated at step S2 to the crucial bit set at step Speech encoder 3: ^ v- 

55 [0169] The speech decoding processing executed by the decoding device of the portable telephone device is the 
processing of convolution decoding the convolutipn coding output from anotherportable telephone device at step S1 1 
and detecting en-ors by CRC code comparison at step SI 2 for classes having CRC. protective bits. At step SI 3. the 
frame masking .state variable state is updated responsive to the CRC code comparison. ' ' 
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[0170] If no error has been detected at step S13,< the' program'moVefe' to'step SieV tf'the \«riable is ujadated to state 
= 7, the program moves to step S17to verity at step'SIB whether or hot the variable mute has' exceeded 0.8 ais'a result 
of averaging the variable mute (p) of the entire frames and ct-. if the variable has hot exceeded 0.8, the program moves 
to step SI 9 to set mute = 0.8. . : • ;■ ..r J'Vj. - , / :< . . . - 

5 [0171] If it is verified at step S20 that LSP =1 , with the^mode being the differential mode, the program nibves to step' 
S21 to set an interpolation coefficient responsive to the number of errors of the directly previous class I.' LSP decoding 
is done at step s22. The components LSP2, LSP3 and LSP5 corresponding to error comiponents are hot summed. If 
LSP is not 1 at step S20, the program moves to step 823 to perform routine LSP decoding. ' " '■ 
[0172] If the state variable state is 0 or 7 and if the frame is the voiced frame (VUV> 0) at step S24, the program 

10 moves to step S25 to decode the LPC residual spectrum Am by idSOi IdSI arid IdG.* If errors are detected at step S26 
simultaneously from the CRC bits Individually added to IdSO and idSl . low range suppression Is applied at step S27. 
[0173] Meanwhile, since Am has been converted to the fixied dimension, dinriension conversion is effected at step 
S28 for restoration to the pitch-based inherent dimension. " - - ' ' • ^ . : * v* - 

[0174] If the rate is 4 kbps at step S29, the program rnoves to step S30 where' the CRC bits covering idS2_4k, 

75 idS3_4k and idS4_4k as extension components of Am are checked. The extension components are summed at step 
S31 only when no error has been detected. 

[0175] If state = 7 at step 832, and prevVUV (VUV of the previous frame) = 0, indicating unvoiced sound, the pro- 
gram moves to step S33 to suppress Am. 

[0176] The portable telephone device having the transmisision path encoder 4 and the encoder 3 as encoding 
20 device on the transmitting side is able to output encoding data strong against transmission' path errors. ^ ' 

[0177] Also, the portable telephone device having the'trahsmission path decoder and the speech decoder embod- 
ying the decoding method and apparatus of the present invention on the receiving side is able to suppress the IdWering 
of the sound quality othenvise caused by the transmission path errors. 

[0178] The portable telephone device having the decoding device and method ehfibo'dying the present invention 
25 has been explained above. However, the present invention Is not limited to ai decoding device 6f the portable telephone 
device but is applicable to e.g., a transmission system. - 
[0179] Rg.27 shows an illustrative structure of an embodiment of a transmission system embodying the present 
invention. Meanwhile, the system means a logical :assembly of plural devices, without regard to whether or not the 
respective devices are in the same casing. ... 
30 [0180] In this transmission system, the decoding device is owned by a client terminal 63, whilst the encoding device 
is owned by a server 61 . The client terminal 63 and the server 61 are interconnected over a network 62, e.g., the Inter- 
net. ISDN (Integrated Service Digital' Network). LAN'(Local Area Network) or PSTN (Public Switched Telephone Net- 
work). 

[0181] If a request for audio signals, such as musical numbers^ is made from' the client terminal 63 to the server 1 

35 over the network 62; the encoding parameters of audio signals corresponding to requested musical nurnbers are pro- 
tected responsive to psych oacoustic sensitivity of bits against transrhission path errors on the network 62 and transmit- 
ted to the client terminal 63, which then decodes the encoding parameters protected against the transmission path' 
errors from the server 61 responsive to the decoding method to output the decoded signal as speech from an output 
device, such as>a speaker. ' / « ■ • ■ • 

40 [0182] Rg.28 shows an illustrative hardware structure of a server at of Fig.27. - 

[0183] A ROM (read-only memory) 71 has stored therein e.g., IPL (Initial Program Loading) program. The CPU 
(central processing unit) 72 executes an OS (operating system) program, in accordance with the IPL program stored in 
the ROM 71. Under the OS control, a pre-set application program stored in an exterhial storage device 76 is executed 
to protect the encoding processing of audio sign alls and encoding obtained' on encoding to perfdrnri trahsrriiission' 

45 processing of the encoding data to the client terminal 63:' A- RAM (randohn access 'membry)' 73 merhbrizes programs or 
data required for operation of the CPU 72. An input device 74 is made up e.g., of a'keyboard, a mouse, a microphone 
or an external interface, and is acted upon when iriputtihg necessary data or comrhands. The input device 74 is also 
adapted to operate as an interface for accepting inputs from outside of digital audio signals furnished to the client ter- 
minal 63. An output device 75 is constituted by e.g., a display, a speaker or a printer, and displays and outputs the nec- 

50 essary informatiori: An external memory 76 comprises e.g^; a'hard disc having stored therein the abo\^-mentioned OS 
or the pre-set application program. A communication device'77 performs contrbl necessary for coriimunication over the 
network 62. • . . . . • • > . 

[0184] The pre-set application program stored in the external memory 76 is a program for causing the functions of 
the speech encoder 3, transmission path encoder 4 or the modulator 7 to be executed by the CPU 72. 

55 [0185] Rg.29 shows an illustrative hardware structure of the client terminal 63 shown in Fig.27. 

[0186] The client temfiinal 63 is made' up of a ROM 81 to a communfcatibn device 87 and is basically configured 
similarly to the server 61 constituted by the ROM 71 to the communicatioh device 77. - 

[0187] It is noted that an external memory SiS has^stbred thereih a program, as an application program, forexecut- 
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ing the decoding method.:Ofjthe present invention for; deciding ,therencQded:data fron:i;Ahe server 61 or a program for 
perfonming other processing as will npw,be.explainedi;By exec.ution of thesejappjlcation' progranris, the CPU 82 decodes, 
or reproduces the encoded dateprotected v..:^.:l:*: • v.: ... 

[0188] Specifically, the external memory 86 has stored therein an application program which causes the CPU 82 to 
5 execute the functions of the demodujatorl 3, transmis^^^ > 

[0189] Thus. the client .terminal.63 is able to realize, the decoding method stored in.the external hnemory 86- as soft- 
ware without requiring the hardware structure shovy^^ 1: 

[0190] It is also possible for the client terminai 63 to store the encoding data transmitted from the server 61 to the 
external storage,86 jand to read put the encoded data at a desired time to execute the encoding method to output the 
10 speech at a desired time. The encoded, data rnay also be stored in another external memory, such as a magneto-optical 
disc or other r«i;ording rnedlum. . « ; , ^ . . : ? , , . . 

[0191] Moreover, as the external memory 76 of the serve n 61:, recordable ^medi urns, such as magneto-optical disc % 
or magnetic recording medium may be used to record the encoded data on these recording mediums. 
[01 92] Figs. 28 and 29 show the logical an^ngement of the various elements which are physically arranged as suit- 
es able for the circunristances of use. . ^ . , ; < . ' 

Claims . .... . . -.-^ ' * 

1. A decoding apparatus for decoding transmitted encoded parameters corresponding to input speech signals which 
20 have been encoded by an encoding device from one pre-set encoding unit on the time, axis to another and which 

have been classed depending on the psychoacpustic sensitivity of .bits to transmission path ^errors, the decoding 
apparatus .comprising: , , . - • ^. .»-.v- - , •? • 

transmissipa path decoding m^ans for detecting an error using an errpr check code appended to said encoding 
25 parameters of a pre-set.class arid providing. for different frame masl^ing processing depending on errors in the 

encoding parameters produced in said encoding unit. -e. .. ^ . . 

2. A decoding apparatus according tp,claim 1 , wherein the. encoding processing in said encoding device is perfomied 
using a plurality of vector quantizers, an index representing each code vector is classified according to psychoa- 

30 coustic sensitivity of bits against transrnissi on path errors and is transmitted in this classified state, said transmis- 
sion path decoding means detecting the error ciieck codes appended to.said index of .a pre-set class to provide for 
different frame masking processing depending on an iridex error.prpduced In said encoding unit. 

3. A decoding apparatus acpording to claim 2, wherein the indexes, transmittedfrom. said encoding device is divided . 
35 into a class which is convolutipn. coded following appendage of an error check code responsive to psychoacoustic 

sensitivity of the bits, against the transmission path error, a class-in which simply an error check code is appended 
and a class in which no processing is made. , :r ? ,> - 

4. A decoding apparatus according to clainh 2 or 3, wherein said transmission path decoding means performs frame 
40 masking processing if errors are found in more than a pre-set nuniber of; encoding units, and.if othenwise. the , 

indexes ia error are straightiy used. , , i . ,• "^i : 

5. A decoding.^pparatus. according to any . one of thejpreceding,clainis, yvherein the encoding processing in :the 
encoding ^eyice is perforrned usirig a nautti-s^ vector quantizer prepared by alternate learning, and wherein an 

45 index representing each code vector is plassified according to psychoacoustic sensitivity of bits against transmis-!- 
sion path errors and is transmitted in this classified state, said transrriission path decoding means detecting the 
error check codes appended to said inde)^ of a pre-set class tp- provide. for different frame masking. processing 
depending on whether or not arj error has been detected in plural, ind^ . : - . u 

SO 6. A decoding apparatus accprding to claim 5 wherein the encoding. processing in the encoding device,is the encoding, 
processing of quantizing and transmitting the spectraMnfomiatipn of the.speech by a two-stage vector quantizer by 
alternate learning, and wherein said transmission path decoding means performs frame masking processing sup- 
pressing the low range when indexes representing code vectors* outputted; by two. vector quantizers are in error 
simultaneously. : .^ . • - -? ^ , . ^ 

7. A decoding, apparatus according to any one, of the preceding claims, wherein the transmission path decoding 
means perfomris frame masking processing of..suppressing the. signal level if; after errors are detected in one or 
more consecutive frames as pre-set encoding units on the tirne axis, a first regular frame is received, the first reg- 
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ular frame is voiced and the last regular frame after the error occurrence is unvoiced. 

8. A decoding method for decoding transmitted encoded parameters corresponding to input speech signals which are 
previously encoded by an encoding device from one pre-set encoding unit on the time axis to another and which 
5 are previously classed depending on the psychoacoustic sensitivity of bits to transmission path errors, said decod- 

ing method comprising: 

a transmission path decoding step of detecting an error using an error check code appended to said encoding 
parameters of a pre-set class and providing for different frame masking processing depending on en-ors in the 
10 encoding parameters produced in said encoding unit * 

9- A decoding method according to claim 8, wherein the encoding processing in said encoding device is performed 
using a plurality of vector quantizers, an index representing each code vector is classified according to psychoa- 
coustic sensitivity of bits against transmission path errors and is transmitted in this classified state, said transmis- 
15 sion path decoding step detecting the error check codes appended to said index of a pre-set class to provide for 

different frame masking processing depending on an index error produced in said encoding unit. 

10. A decoding method according to claim 8 or g, wherein the encoding processing In the encoding device is performed 
using a multi-stage vector quantizer prepared by alternate learning, and wherein an index representing each code 
20 vector is classified according to psychoacoustic sensitivity of bits against transmission path errors and is transmit- 

ted in this classified state, said transmission path decoding rheans detecting the error check codes appended to 
said index of a pre-set class to provide for different ^me masking processing depending on whether or not an error 
has been detected in plural indexes. • • • " ' 

25 11. A computer program which is capable, when executed by a computer processing means, of causing the computer 
processing means to perform a method according to any orie of claims 8 to 10. * ' ^ 

1 2. A recording medium storing a computer program according to claim 11. 
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PARAMETERS 


MEANING 


NUMBER 
OF BITS 


LSPO 


LSP index 0 


(5 bit) 


LSP2 


LSP index 2 


(7 bit) 


LSP3 


LSP index 3 


(5 bit) 


LSP4 


LSP index 4 


(1 bit) 


VUV 


voiced/unvoiced flag ' 


(2 bit) 


PCH 


pitch parameter 


(7blt) 


idSO 
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